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Foreword 



The fourth volume on Advances and Applications of Dezert-Smarandache Theory 
(DSmT) for information fusion collects theoretical and applied contributions of 
researchers working in different fields of applications and in mathematics. The 
contributions (see List of Articles published in this book, at the end of the volume) 
have been published or presented after disseminating the third volume (2009, http:// 
fs.gallup.unm.edu/DSmT-book3.pdf) in international conferences, seminars, 
workshops and journals. 

First Part of this book presents the theoretical advancement of DSmT, dealing with 
Belief functions, conditioning and deconditioning, Analytic Hierarchy Process, 
Decision Making, Multi-Criteria, evidence theory, combination rule, evidence distance, 
conflicting belief, sources of evidences with different importance and reliabilities, 
importance of sources, pignistic probability transformation, Qualitative reasoning 
under uncertainty, Imprecise belief structures, 2-Tuple linguistic label, Electre Tri 
Method, hierarchical proportional redistribution, basic belief assignment, subjective 
probability measure, neutrosophic logic, Evidence theory, outranking methods, 
Dempster- Shafer Theory, Bayes fusion rule, frequentist probability, mean square error, 
controlling factor, optimal assignment solution, data association, Transferable Belief 
Model, and others. 

More applications of DSmT have emerged in the past years since the apparition of 
the third book of DSmT 2009. Subsequently, the second part of this volume is about 
applications of DSmT in correlation with Electronic Support Measures, belief function, 
sensor networks, Ground Moving Target and Multiple target tracking, Vehicle-Born 
Improvised Explosive Device, Belief Interacting Multiple Model filter, seismic and 
acoustic sensor, Support Vector Machines, Alarm classification, ability of human 
visual system, Uncertainty Representation and Reasoning Evaluation Framework, 
Threat Assessment, Handwritten Signature Verification, Automatic Aircraft 
Recognition, Dynamic Data-Driven Application System, adjustment of secure 
communication trust analysis, and so on. 

Finally, the third part presents a List of References related with DSmT published or 
presented along the years since its inception in 2004, chronologically ordered. 

We want to thank all the contributors of this fourth volume for their research works 
and their interests in the development of DSmT. 
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We are grateful as well to other colleagues for encouraging us to edit a new 
volume, for sharing with us several ideas and for their questions and comments on 
DSmT through the years. We thank the International Society of Information Fusion 
(www.isif.org) for diffusing main research works related to information fusion 
(including DSmT) in the international fusion conferences series over the years. 

This book is dedicated to the memory of our good friends and colleagues Dr. Jean- 
Pierre Le Cadre, Prof. Pierre Valin (ISIF president 2006) and Prof. Darko Musicki 
(ISIF President 2008) who have always been very active in ISIF and in the 
organization of past fusion conferences. We will never forget them. 

Also, Florentin Smarandache is grateful to The University of New Mexico, U.S.A., 
that many times partially sponsored him to attend international conferences, workshops 
and seminars on Information Fusion, and Jean Dezert is grateful to the Department of 
Information Modeling and Processing (DTIM) at the French Aerospace Lab (Office 
National d’Etudes et de Recherches Aerospatiales), Palaiseau, France, for encouraging 
him to carry on this research and for its financial support. 

For the next volume, the authors are pleased to send their articles on DSmT to the 
editors: 

Prof. Florentin Smarandache (fsmarandache@gmail.com) 
Dr. Jean Dezert (jdezert@gmail.com). 

The Editors. 
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Non Bayesian Conditioning and Deconditioning 

Jean Dezert 
Florentin Smarandache 



Originally published as: Dezert J., Smarandache F.- Non Bayesian 
conditioning and deconditioning, in Proc. of International 
Workshop on Belief Functions, Brest, France, April 2-4, 2010, 
and reprinted with permission. 



Abstract — In this paper, we present a Non-Bayesian condition- 
ing rule for belief revision. This rule is truly Non-Bayesian in 
the sense that it doesn’t satisfy the common adopted principle 
that when a prior belief is Bayesian, after conditioning by X, 
Bel(X\X) must be equal to one. Our new conditioning rule for 
belief revision is based on the proportional conflict redistribution 
rule of combination developed in DSmT (Dezert-Smarandache 
Theory) which abandons Bayes’ conditioning principle. Such 
Non-Bayesian conditioning allows to take into account judiciously 
the level of conflict between the prior belief available and 
the conditional evidence. We also introduce the deconditioning 
problem and show that this problem admits a unique solution 
in the case of Bayesian prior; a solution which is not possible 
to obtain when classical Shafer and Bayes conditioning rules are 
used. Several simple examples are also presented to compare 
the results between this new Non-Bayesian conditioning and the 
classical one. 

Keywords: Belief functions, conditioning, deconditioning, 
probability, DST, DSmT, Bayes rule. 

I. Introduction 

The question of the updating of probabilities and beliefs 
has yielded, and still yields, passionate philosophical and 
mathematical debates [3], [6], [7], [9], [12], [13], [17], [20], 
[22] in the scientific community and it arises from the 
different interpretations of probabilities. Such question has 
been reinforced by the emergence of the possibility and the 
evidence theories in the eighties [4], [16] for dealing with 
uncertain information. We cannot browse in details here all 
the different authors’ opinions [1], [2], [8], [10], [14], [15] 
on this important question but we suggest the reader to start 
with Dubois & Prade survey [5], In this paper, we propose a 
true Non-Bayesian rule of combination which doesn’t satisfy 
the well-adopted Bayes principle stating that P(X\X) = 1 
(or Bel(X\X) = 1 when working with belief functions). 
We show that by abandoning such Bayes principle, one can 
take into account more efficiently in the conditioning process 
the level of the existing conflict between the prior evidence 
and the new conditional evidence. We show also that the 
full deconditioning is possible in some specific cases. Our 
approach is based on belief functions and the Proportional 
Conflict Redistribution (mainly PCR5) rule of combination 
developed in Dezert-Smarandache Theory (DSmT) framework 

[18]. Why we use PCR5 here? Because PCR5 is very efficient 



to combine conflicting sources of evidences 1 and because 
Dempster’s rule often considered as a generalization of Bayes 
rule is actually not deconditionable (see examples in the 
sequel), contrariwise to PCR5, that’s why we utilize PCR5. 
This paper is organized as follows. In section II, we briefly 
recall Dempster’s rule of combination and Shafer’s Condition- 
ing Rule (SCR) proposed in Dempster-Shafer Theory (DST) 
of belief functions [16]. In section III, we introduce a new 
Non-Bayesian conditioning rule and show its difference with 
respect to SCR. In section IV, we introduce the dual problem, 
called the deconditioning problem. Some examples are given 
in section V with concluding remarks in section VI. 

II. Shafer’s conditioning rule 

In DST, a normalized basic belief assignment (bba) m(.) 
is defined as a mapping from the power set 2 e of the 
finite discrete frame of discernment 0 into [0, 1] such that 
m(0) = 0 and ^ Ye2 ©m(X) = 1- Belief and plausibility 
functions are in one-to-one correspondence with m(.) and are 
respectively defined by Bel(X) = yj Zg2 © zcx m (Z) and 
Pl(X) = ]C Zg2 © znx^o m (^)- They are usually interpreted 
as lower and upper bounds of a unknown measure of subjective 
probability P(.), i.e. Bel(X) < P(X) < Pl(X) for any X. In 
DST, the combination of two independent sources of evidence 
characterized by mi(.) and TO 2 (.) is done using Dempster’s 
rule as follows 2 : 

Ex 1 ,x 2 e 2 e mi(Xi)m 2 (X 2 ) 
m DS (X) = * inA ' a = A ' , y , CD 

1 Z*iX 1 .X 2 £2 0 TO l(Xl)m 2 (A 2 ) 

x 1 nx 2 =0) 

Shafer’s conditioning rule 3 (SCR) is obtained as the result 
of Dempster’s combination of the given prior bba m i(.) 
with the conditional evidence, say Y represented by a source 
m 2 (.) only focused on Y, that is such that m 2 (V) = 1. In 
other words, m(X\Y) = m.Ds(X) = (mi 0 m 2 )(X) using 
m 2 (V) = 1 and where 0 symbol denotes here Dempster’s 

1 Due to space limitation, we do not present, nor justify again PCR5 w.r.t. 
other rules since this has been widely explained in the literature with many 
examples and discussions, see for example [18], Vol. 2. and our web page. 

2 assuming that the numerator is not zero (the sources are not in total 
conflict). 

3 also called Dempster’s conditioning by Glenn Shafer in [16]. 
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fusion rule (1). It can be shown [16] that the conditional belief 
and the plausibility are given by 4 : 



Bel(X\Y) = m DS (Z\Y) 

ze 2° 
zcx 



Beh(X UK) — Beh(Y) 
1 - Beh(Y) 



Pl(X\Y)= ™ D s(Z\Y) = Pl pf ( Y ) ] 

z 62 ° Pll{Y) 

z nXjtQ 



( 2 ) 

(3) 



When the belief is Bayesian 5 , i.e. Bel(.\Y) = Pl(.\Y) = 
P(.\Y), SCR reduces to classical conditional probability def- 
inition (Bayes formula), that is P{X\Y) — P(X fl Y)/P(Y), 
with P(.) = mi(.). Note that when Y = X and as soon 
as Bel(X) < 1, one always gets from (2), Bel(X\X) = 1 
because Beh(XuY) = Beh{X U X) = Beh{Q) = 1. 
For Bayesian belief, this implies P{X\X) = 1 for any X 
such that Pi(X) > 0, which we call Bayes principle. Other 
alternatives have been proposed in the literature [8], [15], 
[21], but almost all of them satisfy Bayes principle and they 
are all somehow extensions/generalization of Bayes rule. A 
true Non-Bayesian conditioning (called weak conditioning) 
was however introduced by Planchet in 1989 in [14] but 
it didn’t bring sufficient interest because Bayes principle 
is generally considered as the best solution for probability 
updating based on different arguments for supporting such 
idea. Such considerations didn’t dissuade us to abandon Bayes 
principle and to explore new Non-Bayesian ways for belief 
updating, as Planchet did in nineties. We will show in next 
section why Non-Bayesian conditioning can be interesting. 



III. A Non Bayesian Conditioning Rule 



Before presenting our Non Bayesian Conditioning Rule, 
it is important to recall briefly the Proportional Conflict 
Redistribution Rule no. 5 (PCR5) which has been proposed 
as a serious alternative of Dempster’s rule [16] in Dezert- 
Smarandache Theory (DSmT) [18] for dealing with conflicting 
belief functions. In this paper, we assume working in the same 
fusion space as Glenn Shafer, i.e. on the power set 2 e of 
the finite frame of discernment 0 made of exhaustive and 
exclusive elements. 



A. PCR5 rule of combination 

Definition: Let’s m i(.) and W 2 (.) be two independent 6 bba’s, 
then the PCR5 rule of combination is defined as follows 
(see [18], Vol. 2 for details, justification and examples) when 
working in power set 2 e : m.pcRsi®) = 0 and \/X 6 2 e \ {0} 

tnpcRs(X) = ^ nri(Ai)m 2 (A 2 )-|- 

X 1 ,X 2 £2 0 

X!nx 2 =x 

y- mi{Xfm 2 {X 2 ) to 2 (A) 2 toi(A 2 ) . 

“ e mi(A) + to 2 (A 2 ) m 2 (X) + m 1 {X 2 ) 

X 2 £2^ 

A' 2 nA'— 0 

4 Y denotes the complement of Y in the frame 0. 

5 the focal elements of mi (.\Y) are singletons only. 

6 i.e. each source provides its bba independently of the other sources. 



All fractions in (4) having zero denominators are discarded. 
The extension and a variant of (4) (called PCR6) for 
combining s > 2 sources and for working in other fusion 
spaces is presented in details in [18]. Basically, in PCR5 the 
partial conflicting masses are redistributed proportionally to 
the masses of the elements which are involved in the partial 
conflict only, so that the specificity of the information is 
entirely preserved through this fusion process. It has been 
clearly shown in [18], Vol. 3, chap. 1 that Smets’ rule 7 is 
not so useful, nor cogent because it doesn’t respond to new 
information in a global or in a sequential fusion process. 
Indeed, very quickly Smets fusion result commits the full 
of mass of belief to the empty set!!! In applications, some 
ad-hoc numerical techniques must be used to circumvent this 
serious drawback. Such problem doesn’t occur with PCR5 
rule. By construction, other well-known rules like Dubois & 
Prade, or Yager’s rule, and contrariwise to PCR5, increase 
the non-specificity of the result. 

Properties of PCR5: 

• (P0): PCR5 rule is not associative, but it is quasi- 
associative (see [18], Vol. 2). 

• (PI): PCR5 Fusion of two non Bayesian bba’s is a non 
Bayesian bba. 

Example: Consider 0 = {A, B. C} with Shafer’s model 
and with the two non Bayesian bba’s toi(.) and m 2 (.) 
given in Table I. The PCR5 fusion result (rounded at the 
fourth decimal) is given in the right column of the Table 
I. One sees that m.pcR s(.) in a non Bayesian bba since 
some of its focal elements are not singletons. 



Table I 

PCR5 FUSION OF TWO NON BAYESIAN BBA’ S. 



Focal Elem. 


mi (.) 


m 2 (•) 




rn PCR 5(0 


A 


0.1 


0.2 




0.3850 


B 


0.2 


0.1 




0.1586 


C 


0.1 


0.2 




0.1990 


A U B 


0.3 


0 




0.0360 


AUC 


0 


0.5 




0.2214 


A U B U C 


0.3 


0 




0 



• (P2): PCR5 Fusion of a Bayesian bba with a non Bayesian 
bba is a non Bayesian bba in general 8 . 

Example: Consider 0 = {A, B , C} with Shafer’s model 
and Bayesian and a non Bayesian bba’s mi(.) and m 2 (.) 
to combine as given in Table II. The PCR5 fusion result 
is given in the right column of the Table II. One sees that 
mpcRbf) is a non Bayesian bba since some of its focal 
elements are not singletons. 

This property is in opposition with Dempster’s rule 
property (see Theorem 3.7 p. 67 in [16]) which states that 
if Bel\ is Bayesian and if Bel\ and Bel> are combinable, 
then Dempster’s rule provides always a Bayesian belief 
function. The result of Dempster’s rule noted mps{-) f° r 



7 i.e. the non normalized Dempster’s rule. 

8 In some cases, it happens that Bayesian ® Non-Bayesian = Bayesian. For 
example, with 0 = {A, B, C}, Shafer’s model, mi (A) = 0.3, m i (B) = 
0.7 and m 2 (A) = 0.1. m 2 (B) = 0.2, m 2 (C) = 0.4 and m 2 (AUB) = 0.3, 
one gets mpc 115 (A) = 0.2162, m P c 115 (B) = 0.6134 and rapcR 5 (C) = 
0.1704 which is a Bayesian bba. 
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Table II 

PCR5 fusion of Bayesian and Non Bayesian bba’s. 



Focal Elem. 


"M (•) 


m 2 ( • ) 


™r>.<7(0 


m PCR 5(0 


A 


0.1 


0 


0.0833 


0.0642 


B 


0.2 


0.3 


0.1000 


0.1941 


C 


0.7 


0.2 


0.8167 


0.6703 


AU C 


0 


0.5 


0 


0.0714 



this example is given in Table II for convenience. This is 
the major difference between PCR5 and Dempster’s rule, 
not to mention the management of conflicting information 
in the fusion process of course. 

In summary, and using ® symbol to denote the generic 
fusion process, one has 

- With Dempster’s rule : 

Bayesian ® Non-Bayesian = Bayesian 

- With PCR5 rule: 

Bayesian ® Non-Bayesian = Non-Bayesian (in general) 

• (P3): PCR5 Fusion of two Bayesian bba’s is a Bayesian 
bba (see [18], Vol. 2, pp. 43-45 for proof). 

Example: 0 = {A : B, C} with Shafer’s model and let’s 
consider Bayesian bba’s given in the next Table. The 
result of PCR5 fusion rule is given in the right column 
of Table III. One sees that mpcRt,{-) is Bayesian since 
its focal elements are singletons of the fusion space 2 e . 



Table III 

PCR5 FUSION OF TWO BAYESIAN BBA’S. 



Focal Elem. 


mi (.) 


mat.) 


m D .<?(-) 


m PC Rn(') 


A 


0.1 


0.4 


0.0870 


0.2037 


B 


0.2 


0 


0 


0.0567 


C 


0.7 


0.6 


0.9130 


0.7396 



B. A true Non Bayesian conditioning rule 

Here 9 we follow the footprints of Glenn Shafer in the sense 
that we consider the conditioning as the result of the fusion 
of any prior mass mi(.) defined on 2 e with the bba to 2 (.) 
focused on the conditional event F / I, i.e. m 2 {Y) = 1. 
We however replace Dempster’s rule by the more efficient 10 
Proportional Conflict Redistribution rule # 5 (PCR5) given by 
(4) proposed in DSmT [18], This new conditioning rule is 
not Bayesian and we use the symbol || (parallel) instead of 
classical symbol | to avoid confusion in notations. Let’s give 
the expression of m(X || Y ) resulting of the PCR5 fusion of 
any prior bba mi(.) with to 2 (.) focused on Y . Applying (4): 

m(X || Y) = S\ CR \X, Y) + S^ R5 {X, Y) + S^°{X, Y) (5) 

with 



Sr 5 (X,Y)± rn 1 (X 1 )m 2 (X 2 ) (6) 

Xi,X 2 €2 e 



9 More sophisticated conditioning rules have been proposed in [18], Vol. 2. 
10 It deals better with partial conflicts than other rules unlike Dempster’s 
rule, it does not increase the non-specificity of the result unlike Dubois & 
Prade or Yager’s rule, and it does respond to new information unlike Smets 
rule. 



Sr\X,Y)^m x {Xf Y, 



m 2 (X 2 



A 2 62 0 
AnA 2 =0 



mi(X) + m. 2 {X 2 ) 



Sr(X,Y)±m 2 (Xf Y 



m 1 (X 2 



m 2 (X) + rti\(X 2 ) 



( 7 ) 



( 8 ) 



,Y 2 62 w 
,Yn.Y 2 =0 

where m 2 (Y) = 1 for a given Y ^ 0. 

Since Y is the single focal element of m 2 (.), the term 
5J CR5 (X, Y) in (5) is given by ATe 2 e the term 

Xir\Y=x 

S^ CR5 (X,Y) equals S(X fl Y = 0) • , and the term 

S"- :R5 (X. Y) can be expressed depending on the value of X 
with respect to the conditioning term Y : 

• If X ^ Y then m 2 (X / 7) = 0 (by definition), and 
thus Sf R5 {X, Y) = 0. 

• If X = Y then m 2 (X = Y) = 1 (by definition), and 

thus Sg R5 (X, V) = E x 2 e 2 e 
A 2 nr=0 

Finally, S’J ?R5 (2f, Y) can be written as 



S R 3 CR5 (X, Y) = 6{X ± Y) • 0 +S(X = Y) Y 



= S(X = Y) ■ Y 



A' 2 e2 e 

A 2 nY=0 



A 2 G2 0 

A 2 nr=0 

m\(X 2 ) 

1 + m.i(X 2 ) 



to. \{X 2 ) 

1 + mi(X 2 ) 



Finally, m(X || Y) for X ^ 0 and 7^0 are given by 



t(X||F)= Y rn 1 (X 1 )+S(XnY = d).^^- 

E 



Ai€2 0 

ATnF=A 

+ S(X = Y) 



1 + mi(X) 

mi(X 2 ) 



A 2 G2 fc 



1 + TOi(X 2 ) 



(9) 



A' 2 nY —0 

to(0 || Y ^ 0) = 0 by definition, since PCR5 fusion doesn’t 
commit mass on the empty set. m(X || 0) is kept undefined 11 
since it doesn’t make sense to revise a bba by an impossible 
event. Based on the classical definitions of Bel{.) and Pl{.) 
functions [16], one has: 



Bel(X || Y) = Y m ( Z II y ) 



Z<E 

zcx 



Pl(X\\Y)= Y m ( Z W Y) 



( 10 ) 



( 11 ) 



ze 2 e 



znA'/0 

The ’’true” unknown (non Bayesian) conditional subjective 
probability, denoted P(X||Y), must satisfy 

Bel{X || Y) < P(A'||Y) < Pl(X\\Y) (12) 

n One could also define m(0 || 0) = 1 and m(X ^ 0 || 0) = 0 which 
however would not be a normal bba. 



13 





Advances and Applications of DSmT for Information Fusion. Collected Works. Volume 4 



P(X||Y) can be seen as an imprecise probability and used 
within IPT (Imprecise Probability Theory) [23] if necessary, 
or can be approximated from m(.||Y) using some probabilistic 
transforms, typically the pignistic transform [19] or the DSmP 
transform [18] (Vol.3, Chap. 3). The search for direct close- 
form expressions of Bel(X || Y) and Pl(X || Y) from 
Bel i(.) and Pli(.) appears to be an open difficult problem. 

IV. Deconditioning 

In the previous section we have proposed a new non 
Bayesian conditioning rule based on PCR5. This rule follows 
Shafer’s idea except that we use PCR5 instead of Dempster’s 
rule because we have shown the better efficiency of PCR5 
to deal with conflicting information w.r.t. other rules. In this 
section, we also show the great benefit of such PCR5 rule for 
the deconditioning problem. The belief conditioning problem 
consists in finding a way to update any prior belief function 
( Bel { .), Pl() or m(.)) with a new information related with the 
(belief of) occurrence in a given conditional proposition of the 
fusion space, say Y, in order to get a new belief function called 
conditional belief function. The deconditioning problem is the 
inverse (dual) problem of conditioning. It consists to retrieve 
the prior belief function from a given posterior/conditional 
belief function. Deconditioning has not been investigated in 
deep so far in the literature (to the knowledge of the authors) 
since is is usually considered as impossible to achieve 12 , 
it may present great interest for applications in advanced 
information systems when only a posterior belief is available 
(say provided by an human or an Al-expert system), but for 
some reason we need to compute a new conditioning belief 
based on a different conditional hypothesis. This motivates 
our research for developing deconditioning techniques. Since 
Bel{.), Pl() are in one-to-one correspondence with the basic 
belief assignment (bba) mass m(.), we focus our analysis on 
the deconditioning of the conditional bba. More simply stated, 
we want to see if for any given conditional bba m(.||Y) we 
can compute mi(.) such that m(.\\Y) = PCR 5 (mi(.),m. 2 {-)) 
with m^iY) = 1 and where PCR 5 (mi(.),m 2 (.)) denotes 
the PCR5 fusion of mi(.) with m 2 (.). Let’s examine the two 
distinct cases for the deconditiong problem depending on the 
(Bayesian or non-Bayesian) nature of the prior 

• Case of Bayesian prior mr(.): Let 0 = {(){ , 62 , . . . , 9 n }, 
with n > 2, Shafer’s model, where all 0, are singletons. 
Let mi : 0 i— > [0, 1] be a Bayesian bba/mass. In that case, 
the deconditioning problem admits a unique solution 
and we can always compute mi(.) from m(.||Y) but 
two distinct cases must be analyzed depending on the 
cardinality of the conditional term Y. 

Case 1: When Y is a singleton, i.e. |Yj = 1. Suppose 
m 2 (Y) = 1, with Y = 9j 0 , for j 0 € {l,2,...,n}, 
where jo is fixed. Since the bba’s mi(.) and TO 2 (.) are 
both Bayesian in this case, m(.\\Y) is also a Bayesian 
bba (property P3), therefore m(9i\\Y) = ai, where all 
a,: € [0,1] with E"=i a i = 1- How to find mi(.) such 

-"this truly happens when classical Bayes conditioning is used. 



that m(.||Y) = PCR 5 (mi(.),m 2 (-)) ? Let’s denote 
mi (0i) = Xi, where all Xi € [0,1] and J2i=i x i = L 
We need to find all these Xi. We now combine mr(.) 
with m 2 (■) using PCR5 fusion rule. We transfer Xi, for 
Vi ^ jo, to 9i and 9 J0 proportionally with respect to 
their corresponding masses, Xj and 1 respectively: = 



JO Xi 

1 Xi + 1 



whence wg i = 



— f-r and wg. = Xi . . , 

Xi~\~ 1 °3 0 Xi~\~ 1’ 



while aj 0 = Xj 0 - 



E Tl Xi 

.*= r T+T or a Jo 
. l Aj 0 



= !-£”= 



if jo 



Xi~\~ 1 * 



Since we need to find all unknowns x», i = 1 
we need to solve , = a,;, for i ^ jo for xp, 

we get x jo = 



E n 

i= 1 



°3o 



Xi + 1 



Xi + 1 



Xi + 1 

L E+1 

i=F3o m 

= i - E"=i 

if jo 



Xj 0 . 



Case 2: When Y is not a singleton, i.e. |Y| > 1 (Y can 
be a partial or total ignorance). Suppose W 2 (Y) = 1, 
with Y = 9j 1 U 9j 2 U ... U 9j p , where all ji, j 2 , ..., 
j p are different and they belong to {1, 2, . . . , n}, 2 < 
p < n. We keep the same notations for m(.||Y) and 
Bayesian 777. 1 ( . ) . The set {ji,j 2 , ■ ■ ■ ,jp} is denoted J 
for notation convenience. Similarly, using PCR5 rule we 
transfer Xj, Vi ^ J, to Xi and to the ignorance Y = 
9j 1 U . . . U 9j p proportionally with respect to x, and 1 
respectively (as done in case 1). So, x, for 1 <f J \s found 



from solving the equation 



Xi + 1 



= ai , which gives 13 Xi = 
,p}- 



(ai+i/af + 4a.j)/2; and Xj r = aj r for r€ (1,2,.. 

Case of Non-Bayesian prior mi(.): 

Unfortunately, when mi(.) is Non-Bayesian, the (PCR5- 
based) deconditioning problem doesn’t admit one unique 
solution in general (see the example 2.1 in the next 
section). But the method used to decondition PCR5 when 
mi (.) is Bayesian can be generalized for 7771 (.) non- 
Bayesian in the following way: 1) We need to know the 
focal elements of 777. 1 ( - ) , then we denote the masses of 
these elements by say xi, X 2 , . . . , x„; 2)Then we combine 
using the conjunctive rule 7771 (.) with 7712 +) = 1, where 
Y can be a singleton or an ignorance; 3) Afterwards, we 
use PCR5 rule and we get some results like: /)(x 1 , ..., x n ) 
for each element, where i = 1,2,.... Since we know 
the results of PCR5 as 7 to(.||Y) = a, for each focal 
element, then we form a system of non-linear equations: 
fi(x i,X2,...,x n ) = ai and we need to solve it. Such 
systems of equations however can admit several solutions. 
We can select a solution satisfying an additional criterion 
like by example the minimum (or the maximum) of 
specificity depending of the kind of Non-Bayesian prior 
we need to use. 



V. Examples 

A. Example 1: Conditioning of a Bayesian prior belief 

Let’s consider 0 = { A , B , C}, Shafer’s model, and the 
prior bba’s mi(.) and m [ (.) given in Table IV and the 
conditional evidence Y A IJ /I. 



13 The solution Xi = (ai — J a? +Aai)/2 must be discarded since it is 
negative and cannot be considered as a mass of belief. 
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Table IV 

Bayesian priors (inputs). 



Focal Elem. 


mj 


m i() 


A 


0.49 


0.01 


B 


0.49 


0.01 


C 


0.02 


0.98 



The significance of having two cases in the Bayesian prior 
case is straighforward. We just want to show that two different 
priors can yield to the same posterior bba with Bayes/SCR rule 
and thus we cannot retrieve these two distinct priors cases from 
the posterior bba. We show that the total deconditioning is 
possible however when using our non-Bayesian conditioning 
rule. SCR and PCR5-based conditioning of mi(.) and m\ (.) 
are given 14 in Table V. One sees that SCR of the two distinct 
bba’s mi(.) and mj (.) yield the same posterior/conditional 
bba m(.\Y) which means that in this very simple Bayesian 
prior case, the deconditioning of m(.\Y) is impossible to 
obtain since at least two solutions 15 for the prior beliefs are 
admissible. The results provided by PCR5-based conditioning 
makes more sense in authors’ point of view since it better takes 
into account the degree of conflicting information in the con- 
ditioning process. One sees that two distinct Bayesian priors 
yield two distinct posterior bba’s with PCR5-based condition- 
ing. If one examines the belief and plausibility functions, one 
gets, using notation A(.|y) = [Bel(.\Y), Pl(.\Y)\, A'(.|Y) = 
[Bel'(.\Y),Pl'(.\Y)], A(.||Y) = [Bel(.\\Y), Pl(.\\Y)} and 
A'U\Y) = [Bel’(.\\Y),Pl'tt\Y)\: 



Table V 

Conditional bb a’ s . 



Focal Elem. 


m(.|Y) 


m'(.|Y) 


m(.||Y) 


m'(.||Y) 


A 


0.5 


0.5 


0.4900 


0.0100 


B 


0.5 


0.5 


0.4900 


0.0100 


C 


0 


0 


0.00039215 


0.48505051 


AU B 


0 


0 


0.01960785 


0.49494949 



Table VI 

Conditional lower and upper bounds of conditional 

PROBABILITIES 





A(.|Y) = A'(.|Y) 


A(.||Y) 


A'(.||Y) 


0 


[0,0] 


[0,0] 


[0,0] 


A 


[0.5, 0.5] 


[0.4900, 0.5096] 


[ 0.0100, 0.5050] 


B 


[0.5, 0.5] 


[0.4900, 0.5096] 


[ 0.0100, 0.5050] 


C 


[0,0] 


[0.0004, 0.0004] 


[0.4850,0.4850] 


Y = AU B 


[1,1] 


[0.9996,0.9996] 


[0.5150,0.5150] 


AU C 


[0.5, 0.5] 


[ 0.4904, 0.5100] 


[0.4950, 0.9900] 


B U C 


[0.5, 0.5] 


[ 0.4904, 0.5100] 


[0.4950, 0.9900] 


A U B U C 


[1,1] 


[1,1] 


[1,1] 



The interval A(.|Y) corresponds to lower and upper bounds 
of conditional subjective probabilities P(.\Y) and A(.||Y) 
corresponds to lower and upper bounds of P(.||Y) (similarly 
for A'(.|Y) and A'(.||Y)). From the Table VI, one sees that 
the property P2 is verified and we get an imprecise conditional 
probability. One sees that contrariwise to SCR (equivalent 
to Bayes rule in this case), one gets Bel{Y\\Y) < 1 and 
also Pl(Y\\Y) < 1. A(.||Y) and A'(.||Y) are very different 
because priors were also very different. This is an appealing 

14 Due to space limitation constraints, the verification is left to the reader. 

15 Actually an infinite number of solutions exists. 



property. If one approximates 16 the conditional probability by 
the mid-value of their lower and upper bounds 17 , one gets 
values given in Table VII. 



Table VII 

Conditional approximate subjective probabilities. 





P(.|Y) = P'(.|Y) 


P(.||Y) 


P'(.||Y) 


0 


0 


0 


0 


A 


0.5 


0.4998 


0.2575 


B 


0.5 


0.4998 


0.2575 


C 


0 


0.0004 


0.4850 


Y = A U B 


1 


0.9996 


0.5150 


AU C 


0.5 


0.5002 


0.7425 


SUC 


0.5 


0.5002 


0.7425 


A U S U C 


1 


1 


1 



When the conditioning hypothesis supports the prior 
belief (as for mi(.) and W 2 (.) which are in low conflict) 
the PCR5-based conditioning reacts as SCR (as Bayes 
rule when dealing with Bayesian priors) and P(X.||Y) is 
very close to P(.|Y). When the prior and the conditional 
evidences are highly conflicting (i.e. like m\ (.) and TO 2 (.), 
PCR5-based conditioning rule is much more prudent than 
Shafer’s rule and that’s why it allows the possibility to have 
P{Y\\Y) < 1. Such property doesn’t violate the fundamental 
axioms (nonnegativity, unity and additivity) of Kolmogorov 
axiomatic theory of probabilities and this can be verified 
easily in our example. In applications, it is much better 
to preserve all available information and to work directly 
with conditional bba’s whenever possible rather than with 
approximate subjective conditional probabilities. 

The deconditioning of the posterior bba’s m{. || Y) given 
in the Table V is done using the principle described in 
section IV (when ?ni(.) is assumed Bayesian and for case 
2). We denote the unknowns mi(A) = xi, mi(B) = X 2 
and rni(C') = X 3 . Since Y = A U B and J = {1,2}, we 
solve the following system of equations (with the constraint 
Xi e [0, 1]): x\ = ci\ = 0.49, X 2 = 02 = 0.49 and 
x\/{x 3 + 1) = «3 = 0.00039215. Therefore, one gets 
after deconditioning m\(A) = 0.49, mi(B) = 0.49 and 
?ni(C) = 0.02. Similarly, the deconditioning of m'{. || Y) 
given in the Table V yields to} (A) = 0.01, m} (P) = 0.01 
and to} (C) = 0.98. 

Note that, contrary wise to Bayes or to Jeffrey’s rules [8], 
[11], [21], it is possible to update the prior opinion about an 
event A even if P (A) = 0 using this Non-Bayesian rule. For 
example, let’s consider 0 = { A , B , C}, Shafer’s model and 
the prior Bayesian mass TOi(A) = 0, m\(B) = 0.3 and m\{C) 
= 0.7, i.e. Bel\(A) = P\(A) = P 1(A) = 0. Assume that the 
conditional evidence is Y = AUB, then one gets with SCR 
m(B\A U B) = 1 and with PCR5-based conditioning m(B || A 
U B) = 0.30, m{A U B || A U B) = 0.41176 and m(C || A 
U B) = 0.28824, which means that P {A\A U 
B) = 0 with SCR/Bayes rule (i.e. no update on A ), whereas 
[Bel(A || A U B), Pl(A || A U B)\ = [0,0.41176], [Bel{B || 

16 When the lower bound is equal to the upper bound, one gets the exact 
probability value. 

17 More sophisticated transformations could be used instead as explained in 
[18], Vol. 3. 
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A U B), Pl(B || AuB)\ = [0.30,0.71176] and [Bel{C || 
A U B), Pl(C || AUB)\ = [0.28823, 0.28823], that is P(A jj 
AU B) € [0,0.41176]. Typically, if one approximates P(. jj 
AU B) by the mid-value of its lower and upper bounds, one 
will obtain P(A |] A U B) = 0.20588 (i.e. a true update of 
the prior probability of A), P(B || A U B) = 0.50588 and 
P(C || A U B) = 0.28824. 

B. Example 2: Conditioning of a Non-Bayesian prior belief 

Example 2.1: Let’s consider now 0 = {A,B,C}, Shafer’s 
model, the conditioning hypothesis Y = A U B and the 

following Non-Bayesian priors: 

Table VIE 

Non-Bayesian priors (inputs). 



Focal Elem. 


mj 


to 


A 


0.20 


0.20 


B 


0.30 


0.30 


C 


0.10 


0.10 


AUB 


0.25 


0.15 


A U B U C 


0.15 


0.25 



The conflict between mi(.) and m^V) = 1 and between 
mi(.) and rri 2 (Y) = 1 is 0.10 in both cases. The results of 
the conditioning are given in Table IX. One sees that when 
distinct priors are Non-Bayesian, it can happen that PCR5- 
based conditioning rule yields also the same posterior bba’s. 
This result shows that in general with Non-Bayesian priors the 
PCR5-based deconditioning cannot provide a unique solution, 
unless extra information and/constraints on the prior belief are 
specified as shown in the next example. 



Table IX 

Conditional bba’s. 



Focal Elem. 


m(.| Y) 


m'(.IY) 


"»(• || Y) 


m'(. || Y) 


A 


0.222 


0.222 


0.20 


0.20 


B 


0.333 


0.333 


0.30 


0.30 


C 


0 


0 


0.01 


0.01 


AUB 


0.445 


0.445 


0.49 


0.49 



Example 2.2 : Let’s consider now 0 = {A, B, C, D}, Shafer’s 
model, the conditional evidence Y = C U D and the posterior 
bba m(. || CUD) given in the right column of the table below: 



Table X 

Conditional bba’s. 



Focal Elem. 


m n (.) 




m(. II A) 


A 


XI 


TtL 


0.0333 


B 


x 2 


Xq 

l + x 2 x 

*3 + ttst + iffe 


0.1667 


CUD 


x 3 


0.8000 



If we assume that the focal elements of the prior bba mi(.) 
are the same as for the posterior bba m(. || CUD), then with 
such extra assumption, the deconditioning problem admits a 
unique solution which is obtained by solving the system of 

x 2 

three equations according to Table X; that is 1 ^ = 0.0333, 

whence x\ « 0.2; = 0.1667, whence X 2 ~ 0.5; X 3 + 

t - p h tt 2 — = 0.8000; whence x\ « 0.3. Therefore, the 

l+xi 1+X2 L 

deconditioning of m(. || C U D) provides the unique Non- 
Bayesian solution mi(A) = 0.2, mi (73) = 0.5 and m\{C U 
D) = 0.3. 



VI. Conclusions 

In this paper, we have proposed a new Non-Bayesian con- 
ditioning rule (denoted || ) based on the Proportional Conflict 
Redistribution (PCR) rule of combination developed in DSmT 
framework. This new conditioning rule offers the advantage to 
take fully into account the level of conflict between the prior 
and the conditional evidences for updating belief functions. It 
is truly Non-Bayesian since it doesn’t satisfy Bayes principle 
because it allows P{X || X ) or Bel(X || X) to be less than 
one. We have also shown that this approach allows to solve 
the deconditioning (dual) problem for the class of Bayesian 
priors. More investigations on the deconditioning problem of 
Non-Bayesian priors need to be done and comparisons of 
this new rule with respect to the main alternatives of Bayes 
rule proposed in the literature (typically Jeffrey’s rule and its 
extensions, Planchet’s rule, etc) will be presented in details in 
a forthcoming publication. 
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Abstract — In this paper, we present an extension of the multi- 
criteria decision making based on the Analytic Hierarchy Process 
(AHP) which incorporates uncertain knowledge matrices for 
generating basic belief assignments (bba’s). The combination of 
priority vectors corresponding to bba’s related to each (sub)- 
criterion is performed using the Proportional Conflict Redistribu- 
tion rule no. 5 proposed in Dezert-Smarandache Theory (DSmT) 
of plausible and paradoxical reasoning. The method presented 
here, called DSmT-AHP, is illustrated on very simple examples. 

Keywords: Analytic Hierarchy Process, AHP, DSmT, In- 
formation Fusion, Decision Making, Multi-Criteria. 



I. Introduction 

The Multi-criteria decision-making (MCDM) problem con- 
cerns the elucidation of the level of preferences of decision 
alternatives through judgments made over a number of criteria 
[6]. At the Decision-maker (DM) level, a useful method for 
solving MCDM problem must take into account opinions 
made under uncertainty and based on distinct criteria with 
different importances. The difficulty of the problem increases 
if we consider a group decision-making (GDM) problem 
involving a panel of decision-makers. Several attempts have 
been proposed in the literature to solve the MCGDM problem. 
Among the interesting solutions developed, one must cite 
the works made by Beynon [3]— [6] . This author developed a 
method called DS/AHP which extended the Analytic Hierar- 
chy Process (AHP) method of Saaty [15]— [17] with Dempster- 
Shafer Theory (DST) [23] of belief functions to take into 
account uncertainty and to manage the conflicts between 
experts opinions within a hierarchical model approach. In this 
paper, we propose to follow Beynon’s approach, but instead 
of using DST, we investigate the possibility to use Dezert- 
Smarandache Theory (DSmT) of plausible and paradoxical 
reasoning developed since 2002 for overcoming DST limita- 



tions 1 [24], This new approach will be referred as DSmT-AHP 
method in the sequel. DSmT allows to manage efficiently the 
fusion of quantitative (or qualitative) uncertain and possibly 
highly conflicting sources of evidences and proposes new 
methods for belief conditioning and deconditioning as well [7]. 
DSmT has been successfully applied in several fields of appli- 
cations (in defense, medicine, satellite surveillance, biometrics, 
image processing, etc). In section II, we briefly introduce the 
principle of the AHP developed by Saaty. In section III, we 
recall the basis of DSmT and its main rule of combination, 
called PCR5 (Proportional Conflict Redistribution rule # 5). 
In section IV, we present the DSmT-AHP method for solving 
the MCDM problem. The extension of DSmT-AHP method 
for solving MCGDM problem is then introduced in section V. 
Conclusions are given in Section VI. 

II. The Analytic Hierarchy Process (AHP) 

The Analytic Hierarchy Process (AHP) is a structured 
technique developed by Saaty in [8], [15], [16] based on 
mathematics and psychology for dealing with complex de- 
cisions. AHP and its refinements are used around the world 
in many decision situations (government, industry, education, 
healthcare, etc.). It helps the DM to find the decision that best 
suits his/her needs and his/her understanding of the problem. 

'A presentation of these limitations with a discussion is done in Chap 1 
of [24], Vol. 3. It is shown clearly that the logical refinement proposed by 
some authors doesn’t bring new insights with respect to what is done when 
working directly on the super-power set (i.e. on the minimal refined frame 
satisfying Shafer’s model). There is no necessity to work with a refined frame 
in DSmT framework which is very attractive in some real-life problems where 
the elements of the refined frame do not have any (physical) sense/meaning 
or are just impossible to clearly determine physically (as a simple example, 
if Mary and Paul have possibly committed a crime alone or together, there 
is no way to refine these two persons into three finer exclusive physical 
elements satisfying Shafer’s model). Aside the possibility to deal with different 
underlying models of the frame, it is worth to note that PCR5 or PCR6 rules 
provide a better ability than the other rules to deal efficiently with highly 
conflicting sources of evidences as shown in all fields of applications where 
they have been tested so far. 
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AHP provides a comprehensive and rational framework for 
structuring a decision problem, for representing and quantify- 
ing its elements, for relating those elements to overall goals, 
and for evaluating alternative solutions. The basic idea of 
AHP is to decompose the decision problem into a hierarchy 
of more easily comprehended sub-problems, each of which 
can be analyzed independently. Once the hierarchy is built, 
the DM evaluates the various elements of the hierarchy by 
comparing them to one another two at a time [21]. In making 
the comparisons, the DM can use both objective information 
about the elements as well as subjective opinions about the 
elements’ relative meaning and importance. The AHP converts 
these evaluations to numerical values that are processed and 
compared over the entire range of the problem. A numerical 
weight or priority is derived for each element of the hierarchy, 
allowing diverse and often incommensurable elements to be 
compared to one another in a rational and consistent way. This 
is the main advantage of AHP with respect to other decision 
making techniques. At its final step, numerical priorities are 
calculated for each of the decision alternatives. These num- 
bers represent the alternatives’ relative ability to achieve the 
decision goal. The AHP method can be summarized as [19]: 

1) Model the problem as a hierarchy containing the decision 
goal, the alternatives for reaching it, and the criteria for 
evaluating the alternatives. 

2) Establish priorities among the elements of the hierarchy by 
making a series of judgments based on pairwise comparisons 
of the elements. 

3) Check the consistency of the judgments and eventually 
revise the comparison matrices by reasking the experts when 
the consistency in judgments is too low. 

4) Synthesize these judgments to yield a set of overall priori- 
ties for the hierarchy. 

5) Come to a final decision based on the results of this process. 
Example 1: According to his/her own preferences and using 
the Saaty’s 1-9 ordinal scale, a DM wants to buy a car among 
four available models belonging to the set 0 = { A , B , C, D}. 
To simplify the example, we assume that the objective of DM 
is to select one of these cars based only on three criteria 
(Cl=Fuel economy, C2=Reliability and C3=Style). According 
to his/her own preferences, the DM ranks the different criteria 
pairwise as follows: 1 - Reliability is 3 times as important as 
fuel economy, 2 - Fuel economy is 4 times as important as 
style, 3 - Reliability is 5 times as important as style, which 
means that the DM thinks that Reliability criteria (C2) is the 
most important criteria, followed by fuel economy (Cl) and 
style is the least important criteria 2 . The relative importance 
of one criterion over another can be expressed using pairwise 
comparison matrix (also called knowledge matrix) as follows: 



fl/1 


1/3 


4/r 




"1.0000 


0.3333 


4.0000" 


3/1 


1/1 


5/1 




3.0000 


1.0000 


5.0000 


_!/4 


1/5 


1/lJ 




0.2500 


0.2000 


1.0000 



where the element m.y of the matrix M indicates the relative 
importance of criteria Ci with respect to the criteria Cj. 

2 The relationships between preferences given by a DM may not be transitive 

as shown in this example, nevertheless one has to deal with these inputs even 

in such situations. 



In this example, m\^ = 4/1 indicates that the criteria Cl 
(Fuel economy) is four times as important as the criteria 
C3 (Style) for the DM, etc. From this pairwise matrix, 
Saaty demonstrated that the ranking of the priorities of the 
criteria can be obtained from the normalized eigenvector 3 , 
denoted w, associated with the principal eigenvalue of the 
matrix, denoted A. In this example, one has A = 3.0857 and 
w = [0.2797 0.6267 0.0936]' which shows that C2 criterion 
(reliability) is the most important criterion with the weight 
0.6267, then the fuel economy criterion Cl is the second most 
important criterion with weight 0.2797, and finally C3 criterion 
(Style) is the least important criterion with weight 0.0936 for 
the DM. A similar ranking procedure can be used to find the 
relative weights of each car A, B, C or D with respect to 
each criterion Cl, C2 and C3 based on given DM preferences, 
hence one will get three new normalized eigenvectors denoted 
w(CT), w(C2) and w(C3). By example, if one has the 
following normalized vectors 



[w(Cl) w(C2) w(C3)] 



0.2500 0.4733 0.1129 

0.1304 0.0611 0.4435 

0.5109 0.1832 0.0565 

0.1087 0.2824 0.3871 



then the solution of the MCDM problem (here the selec- 
tion of the ’’best” car according to the DM multicriteria 
preferences) is finally obtained by multiplying the matrix 
[w(Cl) w(C2) w(C3] by the criteria ranking vector w. For 
this example, one will get: 



"0.2500 


0.4733 


0.1129" 




0.2797 




"0.3771" 


0.1304 


0.0611 


0.4435 








0.1163 


0.5109 


0.1832 


0.0565 


X 


0.6267 


— 


0.2630 


0.1087 


0.2824 


0.3871 




0.0936 




0.2436 



Based on this result, the car A which has the most important 
weight (0.3771) will be selected by the DM. The costs could 
also be included in AHP by taking into account the benefit 
to cost ratios which will allow to chose alternative with 
lowest cost and highest benefit. For example, let’s suppose 
that the cost of car A is 21000 euros, the cost of car B is 
13000 euros, the cost of car C is 12000 euros and the cost 
of car D is 18000 euros, then the normalized cost vector 
is [0.3281 0.2031 0.1875 0.2812]', so that the benefit-cost 
ratios are now [0.3771/0.3281 = 1.1492 0.1163/0.2031 = 
0.5724 0.2630/0.1875 = 1.4026 0.2436/0.2812 = 0.8663]'. 
Taking into account now the cost of vehicles, now the best 
solution for the DM is to choose the car C since it offers the 
highest benefit-cost ratio. 



In this paper we do not focus on the rank reversal problem of 
AHP as discussed in [9], [10], [13], [18], [22], but we propose 
an extension of AHP using aggregation method developed 
in DSmT framework, able to make a difference between 
importance of criteria, uncertainty related to the evaluations 
of criteria and reliability of the different sources. 



4 Note that if the relationships on the criteria is transitive, then we can 
easily construct the normalized vector of priorities from a system of algebraic 
equations, without employing Saaty’s matrix approach. For example if in the 
previous example one assumes 4 M23 = 12/1 and M32 = 1/12 instead of 
5/1 and 1/5, then the normalized weighting vector will be directly obtained 
as w = [4/17 12/17 1/17]'. 
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III. Basics of DSmT 

Let 0 = {61,62,- ■■ ,6 n } be a finite set of n elements 
assumed to be exhaustive. 0 corresponds to the frame of 
discernment of the problem under consideration. In general, 
we assume that elements of 0 are non exclusive in order to 
deal with vague/fuzzy and relative concepts [24], Vol. 2. This 
is the so-called free-DSm model. In DSmT, there is no need 
to work on a refined frame consisting in a discrete finite set 
of exclusive and exhaustive hypotheses 5 because DSm rules 
of combination work for any models of the frame. The hyper- 
power set Z) e is defined as the set of all propositions built from 
elements of 0 with U and n, see [24], Vol. 1 for examples. 
A (quantitative) basic belief assignment (bba) expressing the 
belief committed to the elements of D e by a given source 
is a mapping m(-): D e — » [0, 1] such that : m(0) = 0 and 
Sapd 6 m {A) = 1- Elements A £ D e having m(A) > 0 are 
called focal elements of m(.). The credibility and plausibility 
functions are defined in almost 6 the same manner as in DST 
[23], In DSmT, the Proportional Conflict Redistribution Rule 
no. 5 (PCR5) is used generally to combine bba’s. PCR5 
transfers the conflicting mass only to the elements involved 
in the conflict and proportionally to their individual masses, 
so that the specificity of the information is entirely preserved 
in this fusion process. For example: consider two bba’s mi(.) 
and TO 2 (.), AAB = 0 for the model of 0, and mi (A) = 0.6 
and m 2 (B) = 0.3. With PCR5 the partial conflicting mass 
m\(A)m 2 {B) = 0.6 • 0.3 = 0.18 is redistributed to A and 
B only with respect to the following proportions respectively: 
xa = 0.12 and xb = 0.06 because 

xa _ x B _ m 1 (A)m 2 (B ) _ 0.18 _ 9 
mi (A) m. 2 {B) mi{A) + m 2 {B) 0.9 

In this paper, we work in the power set 2 e since most of read- 
ers are usually already familiar with this fusion space. Let’s 
mi(.) and m 2 (.) be two independent 7 bba’s, then the PCR5 
rule is defined as follows (see [24], Vol. 2 for full justification 
and examples): mpcRb{$) = 0 and VX £ 2 e \ {0} 

mp C R 5 (X) = ^2 mi(X 1 )m 2 (X 2 )+ 

X 1 ,X 2 £2 0 

x 1 nx 2 =x 

-y ?n 1 (X) 2 m 2 (X 2 ) m 2 (X) 2 m 1 (X 2 ) . 

m 1 (X) + m 2 (X 2 ) m 2 (X) + m 1 (X 2 ) 

X 2 £2 
x 2 nx=<fi 

where all denominators in (1) are different from zero. If a 
denominator is zero, that fraction is discarded. All proposi- 
tions/sets are in a canonical form. A variant of (1), called 
PCR6, for combining s > 2 sources and for working in 
other fusion spaces (hyper-power sets or super power-sets) is 
presented in [24]. Additional properties of PCR5 can be found 
in [7], Extension of PCR5 for combining qualitative bba’s can 
be found in [24], Vol. 2 & 3. 

deferred as Shafer’s model in the literature. 

6 We just replace 2® by D® in the definitions of credibility and plausibility 
functions. 

7 i.e. each source provides its bba independently of the other sources. 



IV. DSmT-AHP for SOLVING MCDM 

DSmT-AHP aimed to perform a similar purpose as AHP 
[15], [16], SMART [28] or DS/AHP [2], [4], etc. that is to find 
the preferences rankings of the decision alternatives (DA), or 
groups of DA. DSmT-AHP approach consists in three steps: 

• Step 1: We extend the construction of the matrix for 
taking into account the partial uncertainty (disjunctions) 
between possible alternatives. If no comparison is avail- 
able between elements, then the corresponding elements 
in the matrix is zero. Each bba related to each (sub-) 
criterion is the normalized eigenvector associated with the 
largest eigenvalue of the ’’uncertain” knowledge matrix 
(as done in standard AHP approach). 

• Step 2: We use the DSmT fusion rules, typically the 
PCR5 rule, to combine bba’s drawn from step 1 to get a 
final MCDM priority ranking. This fusion step must take 
into account the different importances (if any) of criteria 
as it will be explained in the sequel. 

• Step 3: Decision-making can be done based either on the 
maximum of belief, or on the maximum of the plausibility 
of Decision alternatives (DA), as well as on the maximum 
of the approximate subjective probability of DA obtained 
by different probabilistic transformations. 

Example 2: Let’s consider now a set of three cars 0 = 
{A, B, C} and the criteria Cl=Fuel Economy, C2=Reliability. 
Let’s assume that with respect to each criterion the following 
’’uncertain” knowledge matrices are given: 

M(C'l) = 

A B AU C BUG 
, A I 2 I 3 

M(C2) = B 1/2 1 1/2 1/5 

AUG 1/4 21 0 

_BUG 1/3 5 0 1 

Step 1: (bba’s generation) Applying AHP method, one gets the 
following priority vectors w(Cl) « [0.0889 0.5337 0.3774]' 
and w(C2) « [0.5002 0.1208 0.1222 0.2568]' which are 
identified with the bba’s mci(-) and rricii-) as follows: 
met (A) = 0.0889, mci(B l J C) = 0.5337, mci(A UBU 
C) = 0.3774 and m C2 (A) = 0.5002, m C i{B ) = 0.1208, 
m C i{A U C)= 0.1222 and m C i{B U C) = 0.2568. 

Step 2: (Fusion) When the two criteria have the same full 
importance in the hierarchy they are fused with one of the 
classical fusion rules. In [4] Beynon proposed to use Demp- 
ster’s rule. Here we propose to use the PCR5 fusion rule since 
it is known to have a better ability to deal efficiently with 
possibly highly conflicting sources of evidences [24], Vol. 2. 
With PCR5, one gets: 



Elem. of 2® 


m G1 (■) 


m G2 (■) 


”PGRS(-) 


0 


0 


0 


0 


A 


0.0889 


0.5002 


0.3837 


B 


0 


0 


0.1162 


AU B 


0 


0.1208 


0 


C 


0 


0 


0.0652 


A DC 


0 


0.1222 


0.0461 


B U C 


0.5337 


0.2568 


0.3887 


A U B U C 


0.3774 


0 


0 



Step 3: (Decision-making) A final decision based on 
mpcRa(-) must be taken. Usually, the decision-maker (DM) 
is concerned with a single choice among the elements of 0. 
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Many decision-making approaches are possible depending on 
the risk the DM is ready to take. A pessimistic DM will 
choose the singleton of 0 giving the maximum of credibility 
whereas an optimistic DM will choose the element having the 
maximum of plausibility. A fair attitude consists usually in 
choosing the maximum of approximate subjective probability 
of elements of 0. The result however is very dependent on 
the probabilistic transformation (Pignistic, DSmP, Sudano’s, 
etc) [24], Vol. 2. Below are the values of the credibility, the 
pignistic probability and the plausibility of A, B and C: 



Elem. of © 




BetP(.) 


PH- ) 


A 


0.3837 


0.4068 


0.4298 


B 


0.1162 


0.3105 


0.5049 


C 


0.0652 


0.2826 


0.5000 



The car A will be preferred with the pessimistic or pignistic 
attitudes, whereas the car B will be preferred if an optimistic 
attitude is adopted since one has Pl(B) > Pl(C) > Pl{A). 

The MCDM problem deals with several criteria having 
different importances and the classical fusion rules cannot 
be applied directly as in step 2. In AHP, the fusion is done 
from the product of the bba’s matrix with the weighting 
vector of criteria. Such AHP fusion is nothing but a simple 
componentwise weighted average of bba’s and it doesn’t 
actually process efficiently the conflicting information between 
the sources. It doesn’t preserve the neutrality of a full ignorant 
source in the fusion. To palliate these problems, we propose 
a solution for combining sources of different importances in 
the framework of DSmT and DST. 

Before going further, it is essential to explain the difference 
between the importance and the reliability of a source of 
evidence. The reliability is an objective property of a source, 
whereas the importance of a source is a subjective character- 
istic expressed by the fusion system designer. The reliability 
of a source represents its ability to provide the correct as- 
sessment/solution of the given problem. It is characterized by 
a discounting reliability factor, usually denoted a in [0,1], 
which should be estimated from statistics when available, 
or by other techniques [11], The reliability can be context- 
dependent. By convention, we usually take a = 1 when the 
source is fully reliable and a = 0 if the source is totally 
unreliable. The reliability of a source is usually taken into 
account with Shafer’s discounting method [23] defined by: 

m a {X) = a-m(X), for X A © 

m a (Q) = a • m(O) + (1 — a) 

The importance of a source is not the same as its reliability 
and it can be characterized by an importance factor, denoted (3 
in [0, 1] which represents somehow the weight of importance 
granted to the source by the fusion system designer. The choice 
of P is usually not related with the reliability of the source 
and can be chosen to any value in [0, 1] by the designer 
for his/her own reason. By convention, the fusion system 
designer will take (3 = 1 when he/she wants to grant the 
maximal importance of the source in the fusion process, and 
will take /3 = 0 if no importance at all is granted to this 
source in the fusion process. The fusion designer must be able 
to deal with importance factors in a different way than with 



reliability factors since they correspond to distinct properties 
associated with a source of information. The importance of 
a source is particularly crucial in hierarchical multi-criteria 
decision making problems, specially in the AHP [16], [20]. 
That’s why it is primordial to show how the importance can 
be efficiently managed in evidential reasoning approaches. 
The main question we are concerned here is how to deal 
with different importances of sources in the fusion process in 
such a way that a clear distinction is made/preserved between 
reliability and importance? Our preliminary investigations for 
the search of the solution of this problem were based on the 
self/auto-combination of the sources. But such approach is 
very disputable and cannot be used satisfactorily in practice 
whatever the fusion rule is adopted because it can be easily 
shown that the auto-conflict tends quickly to 1 after several 
auto-fusions [11], Actually a better approach can be used for 
taking into account the importances of the sources and can 
be considered as the dual of Shafer’s discounting approach 
for reliabilities of sources. The idea was originally introduced 
briefly by Tacnet in [24], Vol. 3, Chap. 23, p. 613. It consists 
to define the importance discounting with respect to the 
empty set rather than the total ignorance 0 (as done with 
Shafer’s discounting). Such new discounting deals easily with 
sources of different importances and is very simple to use. 
Mathematically, we define the importance discounting of a 
source m(.) having the importance factor /3 in [0, 1] by: 

jmp(X)=p-m(X), forX^0 
[^(0) = (3 ■ m(0) + (1 — ( 3 ) 

Here we allow to deal with non-normal bba since m^(0) > 0 
as suggested by Smets in [26], This new discounting pre- 
serves the specificity of the primary information since all 
focal elements are discounted with same importance factor. 
Here we use the positive mass of the empty set as an 
intermediate/preliminary step of the fusion process. Clearly 
when (3 = 1 is chosen by the fusion designer, it will mean 
that the source must take its full importance in the fusion 
process and so the original bba m(.) is kept unchanged. 
If the fusion designer takes /3 = 0, one will deal with 
TO /?W = 1 which is interpreted as a fully non important 
source. m(0) > 0 is not interpreted as the mass committed 
to some conflicting information (classical interpretation), nor 
as the mass committed to unknown elements when working 
with the open-world assumption (Smets interpretation), but 
only as the mass of the discounted importance of a source in 
this particular context. Based on this discounting, one adapts 
PCR5 (or PCR6) rule for N > 2 discounted bba’s 
i = 1, 2, . . . N by considering the following extension, denoted 
PCR5 0 , defined by: MX <= 2 e 

mpcfl5 8 (^)= ^2, m\{Xi)m.2{X2)+ 

X 1 ,X 2 €2< S 

x 1 nx 2 =x 

r m i(X) 2 m 2 (X 2 ) v'i 2 {X) 2 ini{X 2 ) , 

. e mi(X) + m 2 (X 2 ) m 2 (X)+m 1 (X 2 ) 

A' 2 £2 

A' 2 nA'=0 
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A similar extension can be done for PCR5 and PCR6 formulas 
for N > 2 sources given in [24], Vol. 2. A detailed presenta- 
tion of this technique with several examples will appear in [25] 
and thus it is not reported here. The difference between eqs. 
(1) and (4) is that mpcR 5 ($) = 0 whereas mpcfl5 0 (0) > 0. 
Since we usually work with normal bba’s for decision making 
support, the combined bba will be normalized. In the AHP 
context, the importance factors correspond to the components 
of the normalized eigenvector w. 

Example 3: Take back example 2 assume that C2 (the relia- 
bility) is three times more important than Cl (fuel economy) 
so that the knowledge matrix is given by: 

TV ft [1/1 1/3] ^ [1.0000 0.3333] 

1V1 [3/1 1 /lJ ~ [3.0000 l.OOOo] 

Its normalized principal eigenvector is w = [0.2500 0.7500]' 
and indicates that C2 is three times more important than Cl 
as expressed in the prior DM preferences for ranking criteria, 
w = [wi W 2 }' can also be obtained directly by solving the 
algebraic system of equations W 2 = 3w± and w\ + W 2 = 1 
with Wi, W 2 G [0, 1]. If we apply the importance discounting 
with j3\ = Wi = 0.25 and P 2 = u >2 = 0.75, one gets the 
following discounted bba’s 



Elem. of 2 @ 
0 
A 
B 

AU B 
C 

AU C 
B U C 
A U B U C 



,Cl(-) 

0.7500 

0.0222 

0 

0 

0 

0 

0.1334 

0.0944 



m /3 9 ,C2(') 

0.2500 

0.3751 

0 

0.0906 

0 

0.0917 

0.1926 

0 



With the PCR50 fusion of the sources mp lt c i(.) and 
m/ 3 2 ,C 2 (•), one gets the results in the table. For decision- 
making support, one prefers to work with normal bba’s. 
Therefore mpc_R 5 0 (.) is normalized by redistributing back 
to pc R 5 0 (0) proportionally to the masses of other focal el- 
ements as shown in the right column of the next table. 



Elem. of 2 <=) 


m PCR5 0 (•) 


normalized / \ 
m PCi?5 fl 


0 


0.6558 


0 


A 


0.1794 


0.5213 


B 


0.0121 


0.0351 


AU B 


0.0159 


0.0461 


C 


0.0122 


0.0355 


AU C 


0.0161 


0.0469 


BUC 


0.1020 


0.2963 


A U B U C 


0.0065 


0.0188 



If all sources have the same full importances (i.e. all /3,= 1), 
then mpcp 5 0 (.) = mpcRb{-) which is normal because in 
such case = ma(-)- From mp™® z # etl (.) one 

can easily compute the credibility, pignistic probability or 
plausibility of each element of 0 for decision-making. In this 
example one gets: 



Elem. of © 


Bel(.) 


BetP(.) 


P K0 


A 


0.5213 


0.5741 


0.6331 


B 


0.0351 


0.2126 


0.3963 


C 


0.0355 


0.2134 


0.3974 



If the classical AHP ’’fusion” method (i.e. weighted arithmetic 
mean) is used directly with bba’s me i(.) and mc 2 {-), one 
gets: 



mAHp ( •) 



0 0 

0.0889 0.5002 

0 0 

0 0.1208 [0.25] 

0 0 X |_0.75j 

0 0.1222 

0.5337 0.2568 

0.3774 0 



0 

0.3974 

0 

0.0906 

0 

0.0917 

0.3260 

0.0944 



which would have provided the following result for decision- 
making 



Elem. of © 


Bel(.) 


SetP(.) 


PIC-) 


A 


0.3974 


0.5200 


0.6741 


B 


0 


0.2398 


0.5110 


C 


0 


0.2403 


0.5121 



In this very simple example, one sees that the importance 
discounting technique coupled with PCR5-based fusion rule 
(what we call the DSmT-AHP approach) will suggest, as with 
classical AHP, to choose the alternative A since the car A has 
a bigger credibility (as well as a bigger pignistic probability 
and plausibility) than cars B or C . It is however worth to 
note that the values of Bel{.), BetP(.) and Pl(.) obtained by 
both methods are slightly different. The difference in results 
can have a strong impact in practice in the final result for 
example if the costs of vehicles have also to be included in 
the final decision (as explained at the end of the example 1). 
Note also that the uncertainties U(X) = Pl{X ) — Bel(X ) 
of alternatives X = A, B,C have been seriously diminished 
when using DSmT-AHP with respect to what we obtain with 
classical AHP as seen in the following table. The uncertainty 
reduction is a nice expected property specially important for 
decision-making support. 



Elem. of © 


U(.) with AHP 


[/(.) with DSmT-AHP 


A 


0.2767 


0.1118 


B 


0.5110 


0.3612 


C 


0.5121 


0.3619 



Important remark: If Dempster’s rule is used instead of 
PCR5i) rule, one gets the following results when compar- 
ing the fusion of mci(-) with mc 2 (-) (i.e. without im- 
portance discounting) with the fusion of m | g 1 - u , 1= o. 25 ,ci(-) 
with m 0 2 - W 2 -o. 75 ,C 2 (-) (i.e. with importance discounting of 
criteria Cl and C2): 



Elem. of 2 <=> 


m DS(') 


m DS.w(') 
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0 


0 


A 


0.3588 


0.3588 


B 


0.0908 


0.0908 


AU B 


0.0642 


0.0642 


C 


0.0918 


0.0918 


AU C 


0.0649 


0.0650 


BUC 


0.3294 


0.3294 


A U B U C 


0 


0 



Clearly, Dempster’s rule cannot deal properly with impor- 
tance discounted bba’s as we have proposed in this work just 
because the importance discounting technique preserves the 
specificity of the primary information and thus Dempster’s 
rule does not make a difference in results when combining 
either me i(.) with mc 2 (•) or when combining m J g 1 ^i,ci(.) 
with mp 27 n t c 2 {-) due to the way of processing of the total 
conflicting mass of belief. PRC5 deals more efficiently with 
importance discounted bba’s as we have shown in this exam- 
ple. So it is not surprising that such discounting technique 
has never been proposed and used in DST framework and this 
explains why only the classical Shafer’s discounting technique 
(the reliability discounting) is generally adopted. By using 
Dempster’s rule, the fusion designer has no other choice 
but to consider importance and reliability as same notions ! 
The DSmT framework with PCR5 (or PCR6) rule and the 
importance discounting technique proposed here provides an 
interesting and simple solution for the fusion of sources with 
different importances which makes a clear distinction between 
importances and reliabilities of sources. 
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V. DSmT-AHP for SOLVING MCGDM 

Previously, a new approach mixing AHP with DSmT solv- 
ing MCDM problem has been presented. In many practical 
situations however, the decision must be taken by a group 
of n > 1 Decision Makers (GDM), denoted GDM = 
{DMi,i = 1,2, ... ,n}, rather than a single DM, and from 
the Multi-Criteria preference rankings of the PMj’s. The 
importance (influence) of each member of the GDM is usually 
non-equivalent [1] and the importance of each DM of the 
GDM must be efficiently taken into account in the final 
decision-making process. Let’s denote by rriDMii-) the re- 
sult of DSmT-AHP approach (see section IV) related with 
DMi G GDM. The MCGDM problem consists in combining 
all opinions/preferences rankings mDMi(-), i = 1 ,...,n 
with their own (possibly different) importances. When all 
DMi s have equal importance, the classical fusion rules 8 © 
for combining moM, (■) can be directly used to get the final 
result tomcgdm(-) = [maMi ©iadm 2 © • • • ©todmJ(-); If 
the D Mj’s have different importance weights w , , the DSmT- 
AHP approach can also be used at the GDM fusion level 
using the importance discounting approach presented here. The 
result for group decision-making is given by the PCR50 fusion 
of mp i . o Mi ( ■ ) , with /3i = w * and then the result must be 
normalized for decision making support. In [6], Beynon used 
the classical discounting technique [23] to readjust TOdm, (•) 
with Wi s and he identified the importance factors with the 
reliability factors. In our opinions, this is disputable since 
importance of a DMi is not necessarily related with its 
reliability but rather with the importance in the problem of the 
choice of his/her Multi-Criteria to establish his/her ranking, or 
it can come from other (political, hierarchical, etc.) reasons. 
In our new approach, we make a clear distinction between 
notions of importance and reliability and both notions can be 
easily taken into account [25] with DSmT-AHP for solving 
MCGDM problems, i.e. we can use the classical discounting 
technique for taking into the reliabilities of the sources, and 
use the importance discounting proposed here for dealing with 
the importances of sources. 

VI. Conclusions and perspectives 

In this paper, we have presented a new method for Multi- 
Criteria Decision-Making (MCDM) and Multi-Criteria Group 
Decision-Making (MCGDM) based on the combination of 
AHP method developed by Saaty and DSmT. The AHP 
method allows to build bba’s from DM preferences of solutions 
which are established with respect to several criteria. The 
DSmT allows to aggregate efficiently the (possibly highly 
conflicting) bba’s based on each criterion. This DSmT-AHP 
method allows to take into account also the different impor- 
tances of the criteria and/or of the different members of the 
decision-makers group. The application of this DSmT-AHP 
approach for the prevention of natural hazards in mountains is 
currently under progress, see [24], Vol.3, Chap. 23, and [27]. 

8 typically the PCR5 or PCR6 rules, or eventually Dempster’s rule if the 
conflict between DMf s is low. 
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Abstract - In theories of evidence, several methods have been 
proposed to combine a group of basic belief assignments al- 
together at a given time. However, in some applications in 
defense or in robotics the evidences from different sources are 
acquired only sequentially and must be processed in real-time 
and the combination result needs to be updated the most recent 
information. An approach for combining sequentially unreliable 
sources of evidence is presented in this paper. The sources of 
evidence are not considered as equi-reliable in the combination 
process, and no prior knowledge on their reliability is required. 
The reliability of each source is evaluated on the fly by a distance 
measure, which characterizes the variation between one source 
of evidence with respect to the others. If the source is considered 
as unreliable, then its evidence is discounted before entering in 
the fusion process. Dempster’s rule of combination and its main 
alternatives including Yager’s rule, Dubois and Prade rule, and 
PCR5 are adapted to work under different conditions. In this 
paper, we propose to select the most adapted combination rule 
according to the value of conflicting belief before combining the 
evidence. The last part of this paper is devoted to a numerical 
example to illustrate the interest of this approach. 

Keywords: evidence theory, combination rule, evidence 
distance, conflicting belief. 

I. Introduction 

Evidence theories 1 are widely applied in the field of infor- 
mation fusion. A particular attention has been focused on how 
to efficiently combine sources of evidence altogether at the 
same time (static approach), and many rules aside Dempster’s 
rule have been proposed [1], [2], [ 6 ], [9]. In many applications 
however, the evidences from different sources are acquired 
sequentially by different sensors or human experts and the 
belief updating and decision-making need to be taken in real- 
time which requires a sequential/dynamic approach rather than 
a static approach of the fusion problem. 

Usually the evidences arising from different independent 
sources are often considered equally reliable in the combina- 
tion process, when the prior knowledge about the reliability 
of each source is unknown. However, all the sources of 
evidence to be combined can have different reliabilities in real 



applications. If the sources of evidence are considered as equi- 
reliable, the unreliable ones may bring a very bad influence 
on combination result, and even leads to inconsistent results 
and wrong decisions. Thus, the reliability of each source must 
be taken account in the fusion process as best as possible 
to provide a useful and unbiased result. In this work, we 
propose to evaluate on the fly the relability of the sources to 
combine based on an evidential distance/reliability measure. 
From this reliability measure, one can discount accordingly 
the unreliable sources before applying a rule of combination 
of basic belief assignments (bba’s). 

Many rules, like Dempster’s rule [7] and its alternatives can 
be used to combine sources of evidences expressed by bba’s 
and they all have their drawbacks and advantages (see [ 8 ], 
Vol. 1, for a detailed presentation). Dempster’s rule, is usually 
considered well adapted for combining the evidences in low 
conflict situations and it requires acceptable complexity when 
the dimension of the frame of discernment is not too large. 
Dempster’s rule however provides counter-intuitive behaviors 
when the sources evidences become highly conflicting. To 
palliate this drawback, several interesting alternatives have 
been proposed when Dempster’s rule doesn’t work well, 
mainly: Yager’s rule [9], Dubois and Prade rule (DP rule) 
[2], and PCR5 (proportional conflict redistribution rule no 5) 
[ 8 ] developed in DSmT framework. The difference among 
Dempster’s rule and its main alternatives mainly lies in the 
distribution of the conflicting belief m®($) which is generally 
used to characterize the total amount of conflict [4] between 
sources. In this paper, we propose to select the proper rule 
of combination based on the value of the total degree of 
conflict m ,0 (0) . The last part of this paper presents a numerical 
example to show how the approach of sequential adaptive 
combination of unreliable sources of evidence works. 

II. Preliminaries 

A. Basics of Dempster-Shafer theory (DST) 

DST [7] is developed in Shafer’s model. In this model, 
a fixed set 0 = { 6 \, 62 , ■ ■ ■ , 0 n } is called the frame of 
discernment of fusion problem. All the elements in 0 are 
mutually exclusive and exhaustive. The set of all subsets of 



DST (Dempster-Shafer Theory) [7] or DSmT (Dezert-Smarandache Theory) [8], 
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0 is called the power set of 0, and it is denoted 2 e . For 
instance, if 0 = {di,8 2 ,9^}, then 2 e = {0, 9\, 0 2 , # 3 , &i U 
0-2, 9 1 U 6 * 3 , 02 U 6 * 3 , 9 1 U 0 2 U 03 }. A basic belief assignment 
(bba), also called mass of belief, is a mapping to : 2 e — > [0, 1] 
associated to a given body of evidence B such that m(0) = 0 
and X^ 4 G 2 em (^) = 1 - The credibility (also called belief) 
of A C 0 is defined by Bel(A) = Es£ 2 e m (B). The 

BCA 

commonality function q(.) and the plausibility function Pl(.) 
are also defined by Shafer in [7], The functions m(.), Bel(.), 
q(.) and Pl(.) are in one-to-one correspondence. 

Let TOi(.) and m 2 (.) be two bba’s provided by two in- 
dependent bodies of evidence Bi and B 2 over the frame of 
discernment 0. The fusion/combination of mi(.) with )n 2 (.), 
denoted m(.) = [mi 0 m 2 ](.) is obtained in DST with 
Dempster’s rule of combination as follows: 



this principle, DP rule is defined by m(0) = 0 and for 
A ^ 0 and A G 2 e by 

m(A) = ^ toi(X)to 2 (F) 

X.Ye 2 e 
xnY=A 

+ ^2 m 1 (X)m 2 (Y) (4) 

A,Ye 2 e 

xnY=<D 

XU Y=A 

• PCR5 rule: PCR5 transfers the partial conflicting mass 
to the elements involved in the conflict, and it is consid- 
ered as the most mathematically exact redistribution of 
conflicting mass to nonempty sets following the logic of 
the conjunctive rule. PCR5 is defined by m(0) = 0 and 
VA ^ 0, A G 2 e by 



m ( 



= 0 



'mi(Xi)rri2(X2) /i\ 

m{A) = Xin ^ A miiXl)m2i x 2) VA^ 0 ,A G 2 ° 

x 1 nx 2 ^0 



The degree of conflict between the bodies of evidence B\ 
and B> is defined by 

m®(0) = ^2 m 1 (X 1 )m 2 (X 2 ) ( 2 ) 

A'i,A 2 e 2 e 

AiHA 2 =0 



Dempster’s rule can be directly extended to the combination 
of S independent and equally reliable sources. It is a commu- 
tative and associative rule of combination and it preserves the 
neutral impact of the vacuous belief assignment defined by 

m vba (Q) = 1 . 



m(A) = *22 m 1 (X 1 )m 2 (X 2 )+ 

Ai,A 2 e 2 e 

x 1 nx 2 =A 

y- mi{A) 2 m 2 {X 2 ) to 2 (A) 2 toi(X 2 ) 
rni(A) + m 2 (X 2 ) m 2 (A) + m 1 (X 2 ) 



The details, examples and the extension of PCR5 formula 
(5) for S' > 2 sources are given in [ 8 ], 

C. Discounting source of evidence 

When the sources of evidences are not considered equally 
reliable, it is reasonable to discount each unreliable source s*, 
i = 1, 2, . . . , S by a reliability factor on G [0, 1], Following 
the classical discounting method [7], a new discounted bba 
m'(.) is obtained from the initial bba m(.) provided by the 
unreliable source .s, as follows 



B. Main alternatives to Dempster’s rule 

Dempster’s rule yields counterintuitive results when the 
evidences highly conflict because of its way of assigning the 
mass of conflicting belief m®(0). Thus, a lot of alternatives 
to Dempster’s rule have been proposed for overcoming limita- 
tions of Dempster’s rule. The main alternative rules including 
Yager’s rule [9], DP rule [2] and PCR5 [ 8 ] are briefly recalled. 

• Yager’s rule: Yager admits the conflicting belief is not 
reliable. So m®(0) is transferred to the total ignorance 
in Yager’s rule. It is given by m(0) = 0 and for 4^0, 
A e 2 e by 

tn{A) = x,re 2 e m i(X) m 2 (Y), for A ^ 0 

API Y=A 

m(G) = m 1 (0)m 2 (0) + Ylx,ve 2 e m 1 (X)m 2 (Y) 

Anr=0 

(3) 

• Dubois & Prade rule: This rule assumes that if two 
sources of evidence are in conflict, one of them is right 
but we don’t know which one. Thus, if X n Y = 0, then 
the mass committed to the set X n Y by the conjunctive 
operator should be transferred to X U Y. According to 



m' (A) = <%i ■ to(A), A / 0 

to'( 0) = 1 ^ EA 62 e m'(A) < 6) 

A^e 

cii = 1 means that the total confidence in the source s$, and the 
original bba doesn’t need to be discounted, a* = 0 means that 
the source is s t is totally unreliable and its bba is revised as a 
vacouous bba m'( 0 ) = 1 , which will have a neutral impact in 
the fusion process. In practice, the discounting method can be 
used efficiently if one has a good estimation of the reliability 
factor of each source. We show in the next section how one 
can evaluate the relability of a source. 

III. Evaluating the reliability of each source 

Without prior knowledge on the reliability of the sources 
of evidence, we propose to evaluate the reliability factors of 
each source based on the distance between the bba from a 
given source s, with respect to the others. If the bba of the 
given source, say s* varies too much with respect to the others, 
this source of evidence is considered not reliable and it will 
be discounted before to be combined. We will show further 
how the discounting/reliability factor can be estimated. We 
implicitly assume here that the following principle ’’Truth is 
reflected by the majority of opinions” holds. 



24 




Advances and Applications of DSmT for Information Fusion. Collected Works. Volume 4 



In [3], Jousselme et al. have proposed the following distance 
measure dj(mi,m2) between two bba’s 2 mi = mi(.) and 
m 2 = m2(.) defined on the same power set 2 e : 

dj(m 1 ,m 2 ) = ^(mi - m 2 ) r D(mi - m 2 ) (7) 

where D is a 2l 0 l x 2> e l positive matrix whose elements are 
defined as Dij = where A* and Bj are elements of 

the power set 2 e . dj(mi,m 2 ) £ [0,1] is a distance which 
measures the similarity between rri | and m 2 considering both 
the values and the relative specificity of focal elements of each 
bba. 

The total degree of conflict 7719 (0) obtained from all focal 
elements which are incompatible doesn’t actually capture the 
similarity between bba’s as shown by Martin et al. in [5]. 

If N pieces of evidence mi, m2, ..., mjv are combined 
sequentially, two approaches similarly with [5] could be used 
to measure the variation between m y and the others. One 
considers the average value dj between m, and the others 
which is given by 

1 1 

= 7— r ^2 ( g ) 

1 i= 1 

The other one is simply defined as 

d 2 _1 ( m f) = mj _1 ) (9) 

where = m{~ 1 (.) is obtained by the sequential 

combinination of the bba’s mi(.), m 2 (. ),..., wij- 1(-), i.e. 
mj _1 (.) = (((mi ® m 2 ) ® m 3 ) • • • ® Wj_i)(.) with a fusion 
rule such as Dempster’s rule. Yager’s rule, DP rule, PCR5, etc. 
The second measure, (m.,), reflects only the difference 

between m j and the combined bba m{ ~~ 1 and thus cannot 
precisely measure the similarity between rrij and the other 
individual evidences mi(.), m2(.), ..., rrij- 1(.) because some 
information on specificities of these individual bba’s has been 
lost forever through the fusion process. The following exam- 
ples will show the distinction between these two methods. 
Example 1: Let’s consider the frame of discernment 0 = 
{A, B, C } , Shafer’s model and the same following bba’s 

Wi(.) : mi (A) = 0.5,mi(i3) = 0.2 

mi(A U B) = mi(C) = ?ni(0) = 0.1 

m 2 (.) : m 2 (A) = 0.5, m 2 {B) = 0.2 

m 2 (A Li 1 3) = m 2 (C) = ?n 2 (0) = 0.1 

rrij(.) : rrij(A) = 0.5 ,rrij(B) = 0.2 

rrij{A U B) = m,j(C) = mj(Q) = 0.1 

The difference between rrij{.), for j > 2, and all the bba’s 
nrii(.), for i < j according to formula (8) gives = 0, 

which shows correctly that rrij (.) is identical to the other bba’s 



for i < j. If one uses the measure d J 2 1 (m y ) defined 
in (9), one gets the results plotted in Fig. 1. 




Fig. 1: Variation of the similarity measure d J 2 1 (m J ) based on 
different fusion rules. 

One sees that there exists a variation of the similarity 
measure using all different fusion rules with a trend to certain 
values when j increases. This is a bad behavior since we know 
that my equals to the others bba’s and we would expect to get 
d^ _1 (nij) = 0 which unfortunately is not the case. That is the 
main reason why we abondon the use of d J 2 ~ 1 (m.j) measure 
in the sequel of this work. 

Example 2: Let's consider the frame of discernment 0 = 
{A, B, C, D, E}, Shafer’s model, and the following bba’s 



mi(.) 


mi (A) = 


0.6, mi 


(■ B) = 


mi(C) = 




mi(D) = 


m 1 (E) 


= 0.1 




m 2 (.) 


m 2 (0) = 


1 






m 3 (.) 


m 3 (0) = 


1 






nij- 1(-) 


mj- t(0) 


= 1 






TOj(-) 


rrij (A) = 


0.6, rrij 


(B) = 


nijiC) = 




nij(D) = 


m 3 {E ) 


= 0.1 





In this example, my(.) = mi(.), but nij(.) is quite different 
from the others bba’s i / 1 . The similarity measure 

d{~ 1 (m.j) between rrij(.), for j > 3 and the bba’s m,(.), 
i < j is d{~ 1 (m.j) = which shows a trend to 0.2 

when j increases. However in such case, one always gets 
using Dempster’s rule. Yager's rule, DP rule or PCR5 rule 
1 (nij) = 0. From such very simple example, one sees that 
one cannot detect the dissimilarity of m ;/ -(.) with a majority 
of quite distinct bba’s when d:! 2 'fnij) measure is used. This 
shows again that d J 2 ~ 1 (nij) is actually not very appropriate 
for measuring the similarity between a given bba m ;; ( . ) and a 
set of bba’s. Therefore we will only consider the measure of 
similarity (m^) in the sequel. 



2 Here for notation convenience, we use the usual vectorial notation mi and m 2 (with boldfaced letter) for representing the entire bba’s usually denoted 
mi(.) and 77t2(.). mi and m 2 are vectors of dimension 2 1 (_) x 1. We assume that the bba’s vectors are both ordered using the same order for their 
components. 
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For managing the computational burden in applications, a 
parameter n < j — 1 is introduced in the measure d{^ 1 (m.j) 
and we define the new measure: 

1 min(nj-l) 

4~Vj) = minf ■ _ n Y ( 10) 

' ' f=l 

The accuracy and the computational complexity of this 
similarity measure increases when n tends to j — 1. 

Let ’s consider a given discounting tolerance threshold oj,i 
in [0,1]. If d^ l ~ 1 ( m.j) > u>d, it indicates that the bba m ? (.) 
will be considered as not similar enough with respect to other 
bba’s and therefore the source Sj is considered as unreliable 
and must be discounted before entering in the fusion process. 
The unreliability of the source Sj may be caused by a fault 
of the sensor or unexpected noises, condition changes, etc. In 
such case, the bba rrij(.) needs to be discounted by formula 
(6). As proposed by Martin et al. in [5], the reliability factor of 

a V A 

the source Sj is chosen as a :j = (1 — d J n '(nij) ) where 
the parameter A is defined in the easiest way with A = 1. 
The larger dissimilarity leads to the less reliability factor. 
If 4 _1 ( m j) < Wd, it means that the dissimilarity between 
with other bba’s is acceptable, and there is no need to 
revise/discount mj(.) in such case. 

IV. Selection of combination rules 

After evaluating the reliability of the sources, we have to 
select a suitable combination rule. Dempster’s rule is known to 
offer pretty good performances when the combined bba’s are 
not in too high conflict, otherwise when the conflict becomes 
too large it is generally considered safer to use alternative 
rules like Yager’s rule, DP rule, and PCR5 rule. The following 
examples show the difference between the different approaches 
for the fusion of sources of evidences. 

Example 3: This is Zadeh’s example [10], Let’s consider 
0 = {A, B, C} with Shafer’s model and the following bba’s 

mi(.) : Wi(A) = 0.9, TOi(S) = 0.1 
m 2 (.) : m 2 (S) = 0.1 ,to 2 (C') = 0.9 

One sees that the two sources are in very high conflict 
because the total conflict is rri0 2 (0) = 0.99. Using Dempster’s 
rule, one gets surprisingly m(B) = 1 which is somehow 
conterintuitive since mi and m 2 both believe in B with a 
little chance, but the fusion result states that B is the only 
possible solution with certainty, which seems unreasonable 3 . 
If we use Yager’s rule, DP rule, and PCR5, one gets: 

• Yager’s rule: m(B) = 0.01, m(0) = 0.99 

• DP rule: m(B) = 0.01, m(A U B) = 0.09, 
m(A U C) = 0.81, m(B U C) = 0.09 

. PCR5: m(A) = 0.486, m(B ) = 0.028, m(C ) = 0.4860 

These results are more reasonable in some sense, but they 
are not the same. Yager’s rule transfers all the conflicting mass 
to total ignorance and produces the least specific result in the 



three rules. DP rule distributes the conflicting mass to the 
union of the involved sets, which makes the uncertainty of the 
result still very large. DP rule produces a less specific result 
than PCR5 but DP is a bit more specific than Yager’s rule. 
PCR5 provides the most specific result since A and C share 
the same bba whereas B keeps a very low belief assignment. 

Therefore, in order to avoid to get counterintuitive results, 
it is reasonable to use Yager’s rule, DP rule, or PCR5 than 
Dempster’s rule as soon as the level of conflict becomes large. 
The choice among Yager’s rule, DP rule, and PCR5 depends 
on the application and the computational resource one has. 
PCR5 is very appropriate if a decision has to be made because 
it provides the most specific solution, but PCR5 requires the 
most computational burden. Sometimes it better to get less 
specific result if we don’t need to take a clear/precise decision 
in case of high conflict between sources. In such case. Yager’s 
rule and/or DP rule can be used instead. When the level of 
conflict between two bba’s is low Dempster’s rule can be 
used since it offers a good compromise between computational 
complexity and the specificity of the result. 

Example 4: Let’s consider 0 = {A, B, C } and 

?rii(.) : toi(A) = 0.35, mi(B) = 0.3, mi (A U B) = 0.15, 
mi{C) = 0.2 

m 2 (.) : m 2 (A) = 0.35, m 2 (f?) = 0.3, m 2 (A U C) = 0.05, 
m 2 (A U B) = m 2 (C) = m 2 (0) =0.1 

TO3(.) : m^{A) = 0.3, m^{B) = 0.3, m^{A U B) = 0.2, 
mz(C) = m^{A U C) = 0.1 

The conflicts between each pair of bba’s are given by 
TO0 2 (0) = 0.455, TO0 3 (0) = 0.395, m| 3 (0) = 0.395. The 
levels of these conflicts are not too large according and the 
sequential combination m(.) = [[mi ©m 2 ] ©rrt 3 ](.) using the 
different rules yields 

• Dempster’s rule: rri(A) = 0.5868, m(B) = 0.3592, 
m(A U B) = 0.0202, m(C) = 0.0338 

• Yager’s rule: m(A) = 0.3105, m{B) = 0.243, m(C) = 
0.0555, m(A U B) = 0.097, m{A U C) = 0.0455, 
m(0) = 0.2485 

. DP rule: m{A) = 0.3255, m{B) = 0.2295, m(C) = 
0.0435, m(A U B) = 0.1975, m(A U C) = 0.0575, 
m(B U C) = 0.0345, m(0) = 0.112 

. PCR5: m(A) = 0.4889, m{B) = 0.3941, m{C) = 
0.0819, m{A U B) = 0.0268, m(A U C) = 0.0083 

All the rules provide reasonable results with assigning the 
largest belief to A, but Dempster’s rule produces the most 
specific result with a less computational effort. Dempster’s 
rule is thus well appropriate when is not too large. 

V. Adaptive combination of sequential evidence 

Here we are concerned with the real-time decision-making 
problem from the sequential acquisition of bba’s mi(.), 
m 2 (.),..., mjv(-) defined on a same frame 0 without any 



'More generally, one can show that Dempster's rule can become insensitive to the variation of input bba’s to combine - see [8], Vol. 1, Chap. 5. p. 114 for 
example. 



26 




Advances and Applications of DSmT for Information Fusion. Collected Works. Volume 4 



prior knowledge about reliability of each source. We start 
with trti(.). When tri2(.) is available, one combines it with 
TOi(.) by a suitable rule according to the value of ? 77 . 0 2 ( 0 ) 
without evaluating the reliability of the two sources. When 
rrij(.), for j > 3 becomes available at the time j, the reliability 
of the source Sj is evaluated and rrij(.) is discounted (if 
necessary) by the approach presented in section III. Before 
combining the discounted bba m'A.) (or to,(.) when no 
discounting occurs) with the last updated bba rn{ _ (.), the 
combination rule is selected according to the value of the 
conflict between rrij(.) and rnj - (.). We use a threshold 
If Ttipjj ( 0) < wg, Dempster’s rule is selected because it 
offers a good compromise between complexity and specificity. 
Otherwise, Yager’s rule, DP rule, or PCR5, are selected upon 
the actual application to avoid to get counterintuitive results. 

The tuning of thresholds Ud and ujq is not easy in general. 
If the thresholds are too large, one takes the risk to get 
counterintuitive results, whereas if they are set to too low 
values the non specificity of the result will become large 
and even will lead to decision-making under big uncertainty. 
Therefore, both thresholds u>d and W 0 need to be determined by 
accumulated experience depending on the actual application . 

VI. Numerical example 

Let us suppose a multisensor-based target identification 
system. From five independent sensors, the system collects five 
pieces of evidence sequentially (actually we consider here 2 
possible bba’s TOsa(-) and 77155 (.) for the fifth source). For 
decision-making in real-time, the combination result needs to 
be updated right after the new evidence arrives. The bba’s 
defined on the power set of 0 = {A, B,C} are as follows 

wi(.) : mi(A) = 0.8, 7771 (B) = O.1,toi( 0) = 0.1 
m 2 (.) : m 2 (A) = 0.4, m 2 {B) = 0.25, 

m 2 (C) = 0.2, ?n 2 (B U C) =0.15, 
m 3 (.) : m. 3 (B) = 0.9, 7773 (C) = 0.1, 

W4(.) : 771 . 4 ( 1 ?) = 0.45 , 7774 (C) = 0.45, m^{B U C) = 0.1, 
m 5 A{-) ■ m.5A{A) = 0.5, m 5 A(A U B) = 0.25, 
m.5A{C) = 0.1, m 5 A(A l J C) = 0.15, 
m 5B (.) : 777.55(B) = 0.5, m 5B {A U B) = 0.25, 

777 . 55 (C) = O.l.TnssCB UC) =0.15. 

The five pieces of evidence are combined sequentially, and 
the results are presented in Table 1. The chosen thresholds are 
u>d = 0.6, o >0 = 0.6 and n = 5. 

All the rules provide reasonable results when combining 
consistent bba’s ?Bi(.) and m 2 (.). The bba m 3 (.) is highly 
conflicting with 7774 (.) and m 2 (.). If there is no prior in- 
formation about the reliability of the sources, we evaluate 
the reliability of each source according to its variation with 
respect to the others. The average similarity distance between 
m 3 and mi, m 2 is so large that d^ _ 1 (m 3 ) > uid- Thus, 
777.3 (•) is considered unreliable. If we combine directly (without 
discounting) m 3 (.) with r?7i(.) using Dempster, Yager, DP 
or PCR5, one gets a high belief in B with all the rules. 



With the adaptive rule, the bba of m 3 (.) is discounted with 
the reliability factor a = 1 — d^ _1 (m 3 ) to get m 3 (.). The 
combination of m 3 (.) with r?7i(.) assigns now the highest 
belief in A. This adaptive method is helpful to deal with 
the high conflicts caused by the unreliability of the sources. 
The difference between 7774 (.) and 7?7i(.), m 2 (.), 777.3 (.) is 
below the tolerance threshold, but the value of 7770(0) between 
?7i4(.) and mf(.) is very large, and 7770(0) = 0.8334 > u®. 
The result of Dempster’s rule indicates that the most credible 
hypothesis is B, whereas A is not possible to happen, which 
is not reasonable. The results produced by Yager’s rule and 
DP rule selected in adaptive rule is full of uncertainty, and we 
even can’t make a clear decision from them because of their 
ways of distributing the mass of conflicting belief. We can get 
the specific output that most belief focuses on hypothesis A 
only if PCR5 is selected in the adaptive rule. As we can see, 
?77i(.) and 7?7 2 (.) strongly support the hypothesis A , whereas 
7773 (.) and 7774 (.) strongly support B. It is not easy to be 
sure what is the true hypothesis. The adaptive rule tends to 
preserve the earlier decision, since it assumes that 7?7i(.) and 
?77 2 (.) where totally reliable, and then ?7i3(.) is considered 
unreliable and thus discounted. When 7775 (.) is available, if 
7775 (.) strongly supports A as with 777 . 5 , 4 (.), the combination 
results of all the adaptive rules commit their highest belief 
in A. If m 3 (.) strongly supports B as with m 3 B{-), the 
combination results will change and assign the highest belief 
in B. The results produced by the adaptive rule with selecting 
combination rules between Dempster and PCR5 are always 
most specific, which is very useful and helpful for decision- 
making in real-time. The good performance of adaptive rules 
lies in the method of evaluating the reliability of sources and 
the way for automatically selecting suitable combination rules. 

VII. Conclusions 

An approach for adaptive combination of unreliable sources 
of evidence has been proposed in this paper for combining 
sequentially the sources without prior knowledge on their 
reliabilities. The reliability of each source is evaluated ac- 
cording to its similarity with respect to the others which 
is measured by an average distance of similarity. When a 
source is not reliable enough, its bba is discounted to diminish 
its influence in the fusion process and on decision-making. 
Before the fusion of the sources, the suitable combination 
rule is selected depending on the mass of conflicting belief 
TO 0 ( 0 ) and the compromise between the computational burden 
and the specificity of the result one wants to deal with. 
Whenever 777.0(0) is below the tolerance threshold, Dempster’s 
rule can be chosen as a good rule of combination for such a 
compromise. Otherwise, Yager’s rule, DP rule, or PCR5 must 
be selected to avoid to get counterintuitive results. The choice 
among these three rules depends on the application and the 
acceptable risk in decision-making errors. PCR5 rule is very 
appropriate to use in general for decision-making because 
it provides the most specific fusion results, but it requires 
more computational resources than other rules. If we want 
to keep uncertain results and don’t necessarily need a very 
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specific decision in case of high conflict between sources. 
Yager’s rule or DP rule can be selected instead. Our numerical 
example shows the interest of the proposed approach. The 
main difficulty however lies in the tuning of the thresholds ojd, 
ujj and the parameter n involved in its implementation. These 
parameters must be selected by experience depending on the 
application. This approach was based on Shafer’s model, but 
could be extended to other models proposed in DSmT. 
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TABLE I 

Combination results by different rules 







mf 








Dempster’s rule 


m(A) = 0.7826 
m(B) = 0.1413 
m(C ) = 0.0435 
m(B U C) = 0.0326 


m(B) = 0.9536 
m{c) = 0.0464 


m(B) = 0.9536 
m(C) = 0.0464 


m(B) = 0.9536 
m(C) = 0.0464 


m(B) = 0.9867 
m(C) = 0.0133 


Yager’s rule 


m(A) = 0.36 
m{B) = 0.065 
m(C ) = 0.02 
rri(B U C) = 0.015 
m(0) = 0.54 


m(B) = 0.558 
m(C) = 0.0575 
m(0) = 0.3845 


m(B) = 0.4799 
m(C) = 0.2046 
m(B U C) = 0.0385 
m(0) = 0.2770 


m(A) = 0.1385 
m{B) = 0.1296 
m(A U B) = 0.0692 
m(C) = 0.0885 
m(A U C) = 0.0415 
m(&) = 0.5327 


m(B) = 0.5993 
m(A U B) = 0.0692 
m(C) = 0.0827 
m(B U C) = 0.0473 
m(0) = 0.2015 


DP rule 


m(A) = 0.36 
m(B) = 0.065 
m(A U B) = 0.24 
m(C ) = 0.02 
m(A U C) = 0.16 
m(B U C) = 0.035 
m(0) = 0.12 


m(B) = 0.414 
m(A U B) = 0.324 
m(C) = 0.0335 
m(A U C) = 0.036 
m(B U C) = 0.0245 
m(0) = 0.168 


m(B) = 0.4925 
m(C) = 0.1249 
m(B U C) = 0.2206 
m(0) = 0.1620 


m(A) = 0.081 
m(B) = 0.1783 
m(A U B) = 0.2868 
m(C) = 0.1026 
m(A U C) = 0.0867 
m{B U C) = 0.0493 
m(0) = 0.2153 


m(B) = 0.6897 
m{A U B) = 0.0405 
m(C ) = 0.0695 
m(B U C) = 0.1691 
m(0) = 0.0312 


PCR5 rule 


m(A) = 0.7734 
m(B) = 0.1273 
m(C) = 0.0653 
m.(B U C) = 0.0340 


m(A) = 0.3902 
m(B) = 0.5814 
m(C) = 0.0284 


m(A) = 0.1942 
m(B) = 0.5733 
m(C) = 0.2245 
m(B U C) = 0.008 


m(A) = 0.4025 
m(B) = 0.4154 
m(A U B) = 0.0296 
m(C) = 0.1346 
m(A U C) = 0.0178 
m(B U C) = 0.0001 


m(A) = 0.1049 
m(B) = 0.7182 
m(A U B) = 0.0296 
m(C) = 0.1334 
m(B U C) = 0.0139 


d J n ~ 1 (m j )(n = 5) 


0.3813 


0.6601 


0.4994 


0.4115 


0.4137 


Dempster's rule 
with discounting 


m(A) = 0.7826 
m(B) = 0.1413 
m(C) = 0.0435 
m.(B U C) = 0.0326 


m{A) = 0.7216 
m(B) = 0.2046 
m{c) = 0.0437 
m(B U C) = 0.0301 


m(B) = 0.7565 
m(C) = 0.2255 
m(B U C) = 0.018 


m(B) = 0.7608 
m(C) = 0.2392 


m(B) = 0.9194 
m(C) = 0.0770 
m{B U C) = 0.0036 


Adaptive rule 
(Dempster&Yager) 


m(A) = 0.7826 
m(B) = 0.1413 
m(C) = 0.0435 
m(B U C) = 0.0326 


m(A) = 0.7216 
m(B) = 0.2046 
m{c) = 0.0437 
m(B UC) = 0.0301 


ni(B) = 0.1261 
m(C) = 0.0376 
m(B U C)= 0.0030 
m(0) = 0.8334 


m(A) = 0.4758 
m(B) = 0.0368 
m(A U B) = 0.2379 
m(C) = 0.1067 
m(A U C) = 0.1427 


m(B) = 0.5550 
m(A U B) = 0.2172 
m(C) = 0.0970 
m(BuC) =0.1308 


Adaptive rule 
(Dempster&DP) 


m(A) = 0.7826 
m(B) = 0.1413 
m(C) = 0.0435 
m(B U C) = 0.0326 


m(A) = 0.7216 
m(B) = 0.2046 
m(C) = 0.0437 
m(B U C) = 0.0301 


m(B) = 0.1261 
m(A U B) = 0.3247 
m(C) = 0.0376 
m(A U C) = 0.3247 
m(B U C) = 0.1147 
m(0) = 0.0722 


m(A) = 0.6232 
m(B) = 0.0765 
m(A U B) = 0.126 
m(C) = 0.0988 
m(A U C) = 0.0755 


m(A) = 0.1062 
m(B) = 0.5844 
m(A U B) = 0.1298 
m\c ) = 0.1429 
m(B U C) = 0.0367 


Adaptive rule 
(Dempster&PCR5) 


m(A) = 0.7826 
m(B) = 0.1413 
m(C ) = 0.0435 
m(B U C) = 0.0326 


m(A) = 0.7216 
m(B) = 0.2046 
m(C) = 0.0437 
m(B U C) = 0.0301 


m{A) = 0.4634 
m(B) = 0.2975 
m(C) = 0.2273 
m(B U C) = 0.0118 


m(A) = 0.7526 
m(B) = 0.1395 
m(C) = 0.1079 


m{A) = 0.2562 
m(B) = 0.6116 
m(C) = 0.1283 
m{B U C) = 0.0039 
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Fusion of Sources of Evidence 
with Different Importances and Reliabilities 

F. Smarandache 
J. Dezert 
J.-M. Tacnet 



Abstract — This paper presents a new approach for 
combining sources of evidences with different impor- 
tances and reliabilities. Usually, the combination of 
sources of evidences with different reliabilities is done by 
the classical Shafer’s discounting approach. Therefore, 
to consider unequal importances of sources, if any, a 
similar reliability discounting process is generally used, 
making no difference between the notion of importance 
and reliability. In fact, in multicriteria decision con- 
text, these notions should be clearly distinguished. This 
paper shows how this can be done and we provide simple 
examples to show the differences between both solutions 
for managing importances and reliabilities of sources. 
We also discuss the possibility for mixing them in a 
global fusion process. 

Keywords: Information fusion, DSmT, discounting, 
importance, reliability, AHP. 

1 Introduction 

In many real-life fusion problems, one has to deal 
with different sources of information arising from hu- 
man reports, artificial intelligence experts systems 
and/or physical sensors. The information are usually 
imprecise, uncertain, incomplete, qualitative or quanti- 
tative and possibly conflicting. The task of information 
fusion is to combine all the information in such a way 
that one has a better understanding and assessment 
of the situation of the complex problem under consid- 
eration for decision-making support. Several theoret- 
ical frameworks have been proposed in the literature 
(Probability theory, Possibilities theory, Imprecise PT, 
etc) but the most appealing ones are the theories of 
belief functions, originally known as Dempster-Shafer 
Theory (DST) [8] and then extended and refined in 
Dezert- Smarandache Theory (DSmT) [9] for dealing 
with qualitative information, for fusioning highly con- 
flicting sources of evidences, for conditioning evidences, 
etc. Aside the choice of the “best 11 rule of combina- 
tion of sources of evidences characterized by their be- 



Originally published as Smarandache F., Dezert J., Tacnet J.-M., 
Fusion of sources of evidence with different importances and 
reliabilities, Fusion 2010, Edinburgh, Scotland, UK, 26-29 July 2010, 

and reprinted with permission. 

lief functions, more specifically by their basic belief as- 
signments (bba’s), or belief masses, the very important 
problem concerns the possibility that sources involved 
in the fusion process may not have the same reliability, 
neither the same importance. The reliability can be 
seen as an objective property of a source of evidence, 
whereas the importance of a source is a subjective prop- 
erty of a source expressed by the fusion system designer. 

The reliability of a source represents its ability to pro- 
vide the correct assessment /solution of the given prob- 
lem. The importance of a source represents somehow 
the weight of importance granted to the source by the 
fusion system designer. The reliability and importance 
represent two distinct notions and the fusion process 
must be able to deal with these notions. We show in 
this paper how this can be done efficiently through two 
discounting techniques using Proportional Conflict Re- 
distribution rules no 5 or no 6 (PCR5 or PCR6) de- 
veloped in DSmT framework. We will show also that 
such solution cannot be used in DST framework us- 
ing Dempster’s rule of combination because Dempster’s 
rule doesn’t respond to our new importance discounting 
(it only responds to reliability discounting 1 ). 

The importance of a source is particularly crucial 
since it is involved in multi-criteria decision making 
(MCDM) problems, like in the Analytic Hierarchy Pro- 
cess (AHP) developed by Saaty [6, 7]. That’s why it 
is fundamental to show how the importance can be ef- 
ficiently managed in evidential reasoning approaches, 
in particular in DSmT. The fusion system designer is 
still free to make no differences between importance and 
reliability and use the classical discounting technique. 
In general however, one should consider the importance 
and the reliability as two distinct notions and thus they 
have to be processed in different ways. This is the pur- 
pose of this paper. The application of this technique 
in DSmT-AHP is presented in [2] and an application of 
both DSmT and AHP for risk expertise and prevention 
in mountains has been introduced by Tacnet in [11, 12] 

1 Known as the classical Shafer’s discounting, see [8]. 
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and works are still under progress in this field. 

This paper is organized as follows. After a brief re- 
minder of basics of DSmT for information fusion and 
its main fusion rule in section 2, we present the clas- 
sical discounting technique for combining sources with 
different reliabilities in section 3. In section 4, we intro- 
duce in a new solution for taking into account the pos- 
sible different importances of sources in the fusion pro- 
cess. Section 5 provides simple examples to show and 
compare the results obtained from the two discounting 
approaches. In section 6 we discuss the more general 
problem where one needs to deal with both reliability 
and importance at the same level in the fusion process. 
Conclusions and perspectives are given in section 7. 

2 Basics of DSmT 

Let 0 = {t?i , 6*2 , • • • , 0 n } be a finite set of n elements 
Oi, i = 1 ,...,n assumed to be exhaustive. 0 corre- 
sponds to the frame of discernment of the problem un- 
der consideration. In general (unless introducing some 
integrity constraints), we assume that elements of 0 
are non exclusive in order to deal with vague/fuzzy and 
relative concepts [9], Vol. 2. This is the so-called free- 
DSm model. In DSmT framework, there is no need in 
general to work on a refined frame consisting in a dis- 
crete finite set of exclusive and exhaustive hypotheses 2 
because DSm rules of combination work for any models 
of the frame, i.e. the free DSm model (no exclusive con- 
straint between Oi, Shafer’s model (all Oi are exclusive) 
or any hybrid model (only some Oi are truly exclusive). 
The power set 2 e is defined as the set of all proposi- 
tions built from elements of 0 with U [8]; 0 generates 
2 e under U. The liyper-power set (Dedekind’s lattice) 
D e is defined as the set of all propositions built from 
elements of 0 with U and H; 0 generates D e under U 
and fl, see [9] Vol. 1 for many detailed examples. The 
super-power set (Boolean algebra) S e is defined as the 
set of all propositions built from elements of 0 with U 
and fl and complement c(.); 0 generates S e under U, fl 
and c(.), see [9] Vol. 3. S e can be seen as the minimal 
refined frame of 0. For notation convenience, we use 
the generic notation G e to represent the fusion space 
under consideration depending on the application and 
the underlying model chosen for the frame 0; which 
can be either G e can be either 2 e , D e or S e . In DST 
framework, G e = 2®, whereas in DSmT we usually 
work with G e = D 0 . 

A (quantitative) basic belief assignment (bba) ex- 
pressing the belief committed to the elements of G e 
by a given source/body of evidence is a mapping func- 
tion ro(-): G e — > [0,1] such that: to( 0) = 0 and 
S.4eG e m (A) = 1. Elements A € G e having m(A) > 0 
are called focal elements of the bba m(.). The general 
belief and plausibility functions are defined respectively 

2 Referred as Shafer’s model in the literature. 



in almost the same manner as Shafer in [8], i.e. 

Bel(A) = J2 m ^ W 

BgG 0 ,BC.A 

Pl(A) = J2 (2) 

BeG e ,snA/0 

In DSmT, the Proportional Conflict Redistribution 
Rule no. 5 (PCR5) has been proposed as a serious 
alternative of Dempster’s rule [8] for dealing with 
conflicting belief functions. It has been also clearly 
shown in [9], Vol. 3, chap. 1 that Smets’ rule 3 is not 
so efficient, nor cogent because it doesn’t respond to 
new information in a global or in a sequential fusion 
process. Indeed, very quickly Smets’ fusion result 
commits the full mass of belief to the empty set!!! 
Therefore in applications, some ad-hoc numerical 
techniques must be used to circumvent this serious 
drawback. Such problem doesn’t occur with PCR5 
rule. By construction, other well-known rules like 
Dubois & Prade, or Yager’s rule, and contrariwise to 
PCR5, increase the non-specificity of the result. An 
introduction to DSmT and PCR5 fusion rule with 
justification and several examples can be found in [9], 
Vol. 3, Chap. 1, freely downloadable from the web. 

Definition of PCR5 (for two sources): Let’s mi(.) 
and m2(.) be two independent 4 bba’s, then the PCR5 
rule of combination for two sources of evidence is de- 
fined as follows (see [9], Vol. 2 for details, justification 
and examples): m.pcRsifb) = 0 and MA G G e \ {0} 

mpcm{A) = ^2 mi(Ai)m 2 (V 2 )-|- 

x 1 ,x 2 eG e 
x 1 nx 2 = a 

y- r mi(A) 2 m 2 (A) m 2 (A) 2 m 1 (X) 

mi(A) + m 2 (X) m 2 (A)+mi(V) 

X gg 
in A=0 

All fractions in (3) having zero denominators are 
discarded. In DSmT, we consider all propositions/sets 
in a canonical form. We take the disjunctive normal 
form, which is a disjunction of conjunctions, and it is 
unique in Boolean algebra and simplest. For example, 
X = 4nBn(dUBUG) it is not in a canonical form, 
but we simplify the formula and X = A n B is in a 
canonical form. Like most of fusion rules 5 , PCR5 is not 
associative and the optimal fusion result is obtained 
by combining the sources altogether at the same time 
when possible. Some of PCR5 properties can be 
found in [1] and it allows non-Bayesian reasoning. An 
extension of PCR5 for combining qualitative bba’s can 

3 i.e. the non normalized Dempster’s rule. 

4 i.e. each source provides its bba independently of the other 
sources. 

5 Except Dempster’s rule, and conjunctive rule in free DSm 
model. 
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be found in [9], Vol. 2 & 3. 

Basically, the idea of PCR5 is to transfer the conflict- 
ing mass only to the elements involved in the conflict 
and proportionally to their individual masses, so that 
the specificity of the information is entirely preserved 
through this fusion process. For example: consider two 
bba’s mi(.) and TO 2 (.), A n B = 0 for the model of 0, 
and mi (A) = 0.6 and 1712 (B) = 0.3. With PCR5 the 
partial conflicting mass mi(A)rri 2 (B ) = 0.6 • 0.3 = 0.18 
is redistributed to A and B only with respect to the 
following proportions respectively: xa — 0.12 and 

Xb = 0.06 because the proportionalization requires 



thus 

f x p A cm = 0.6 • 0.018 = 0.0108 
) x ^ c 2 R6 = 0.3 • 0.018 = 0.0054 
{x^% R6 = 0.1 -0.018 = 0.0018 

and therefore with PCR6, one gets finally the following 
redistributions to A and B: 

x PCm = o 0108 

x Pcm = x pqm + x pcm = 0 0054 + 0 .0018 = 0 .0072 

From the implementation point of view, PCR6 is much 
more simple to implement than PCR5 (see Appendix). 



xa xb m.\(A)m2(B) 0.18 

m\(A) rri 2 (B) m\(A) + m 2 (B) 0.9 



Variant of PCR5 (PCR6): The extension and a vari- 
ant of (3), called PCR6 has been proposed by Mar- 
tin and Osswald in [9], Vol. 2, for combining s > 2 
sources and for working in other fusion spaces is pre- 
sented in [9]. For two sources, PCR6 coincides with 
PCR5. The difference between PCR5 and PCR6 lies 
in the way the proportional conflict redistribution is 
done as soon as three or more sources are involved in 
the fusion. For example, let’s consider three sources 
with bba’s m\(.), m^i-) and m 3 (.), A fl B = 0 for the 
model of the frame 0, and m\(A) = 0.6, 1112 (B) = 0.3, 
m^(B) =0.1. With PCR5 the partial conflicting mass 
m\(A)m 2 (B)ms(B) = 0.6 • 0.3 • 0.1 = 0.018 is redis- 
tributed back to A and B only with respect to the 
following proportions respectively: x RCR5 = 0.01714 
and Xg CR5 = 0.00086 because the proportionalization 
requires 



„PCR5 



„PCR5 



mi (A) m2(B)m^(B) 



m\ (A)m2 (B)m.3 (B) 
mi(A) + m. 2 (B)m 3 (B) 



that is 

~,PCR5 nPCRS 
X A _ X B 

0.6 “ 0.03 



0.018 
0.6 + 0.03 



0.02857 



thus 



x RCR5 = 0.60 • 0.02857 « 0.01714 
x RCR5 = 0.03 • 0.02857 « 0.00086 



With the PCR6 fusion rule, the partial conflicting mass 
mi(A)m 2 (B)m 3 (B) = 0.6 • 0.3 • 0.1 = 0.018 is redis- 
tributed back to A and B only with respect to the fol- 
lowing proportions respectively: x^ CR6 = 0.0108 and 
X PCR6 _ 0.0072 because the PCR6 proportionalization 
is done as follows: 



„PCR6 

X A 

m 1 (A) 
that is 



r PCR6 

L B,2 



yPCRQ 
l B, 3 



to 1 (A)ni2 (B)m.3 (B) 



m 2 (B ) ms(B) m\(A) + m 2 (B) + m 3 (B) 



3 Reliability discounting 

Reliability refers to information quality while impor- 
tance refers to subjective preferences of the fusion sys- 
tem designer. The reliability of a source represents its 
ability to provide the correct assessment/solution of 
the given problem. It is characterized by a discount- 
ing reliability factor, usually denoted a in [0, 1], which 
should be estimated from statistics when available, or 
by other techniques [3]. This reliability factor can be 
context-dependent. For example, if one knows that 
some sensors do not perform well under bad weather 
conditions, etc, one will decrease the reliability factor 
of information arising from that source accordingly. By 
convention, we usually take a = 1 when the source is 
fully reliable and a = 0 if the source is totally unre- 
liable. Reliability of a source is generally considered 6 
through Shafer’s discounting method [8], p. 252, which 
consists in multiplying the masses of focal elements by 
the reliability factor a, and transferring all the remain- 
ing discounted mass to the full ignorance 0. When 
a < 1, such very simple reliability discounting tech- 
nique discounts all focal elements with the same factor 
a and it increases the non specificity of the discounted 
sources since the mass committed to the full ignorance 
always increases. Mathematically, Shafer’s discounting 
technique for taking into account the reliability factor 
a € [0, 1] of a given source with a bba to(.) and a frame 
0 is defined by: 

f m a (X) = a ■ m(X ), for 1/0 

(?tIq,(0) = a • m(0) + (1 — a) 

4 Importance discounting 

The importance of a source is not the same as its re- 
liability and it can be characterized by an importance 
factor, denoted (3 in [0,1]. (3 factor represents some- 
how the weight of importance granted to the source by 
the fusion system designer. The choice of (3 is usually 
not related with the reliability of the source and can be 
chosen to any value in [0, 1] by the designer for his/her 



„PGfl6 r PCR6 PCR6 
X A _ X B,2 _ X B , 3 

0.6 “ 0.3 “ 0.1 



0.018 

0.6 + 0.3 + 0.1 



®M°re sophisticated methods have been also proposed, see 
0.018 [ 4 ^ 5 ] f or example. 
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own reason. By convention, the fusion system designer 
will take (3 = 1 when he/she wants to grant the max- 
imal importance of the source in the fusion process, 
and will take (3 = 0 if no importance at all is granted 
to this source in the fusion process. Typically, if one 
has a pool of experts around a table to take important 
decision, say politicians, scientific researchers, military 
officers, etc, it is possible that one wants to grant more 
importance to the voice of a given politician (say the 
President) rather than to a military officers or a scien- 
tific researcher, even if the scientific researcher is more 
reliable in his expertise field than other people. Such 
situations occur frequently in real-life problems. The 
fusion designer must be able to deal with importance 
factors in a different way than with reliability factors 
since they correspond to distinct properties associated 
with a source of information. 

The main question we are concerned in this paper is 
how to deal with different importances of sources in the 
fusion process in such a way that a clear distinction is 
made/preserved between reliability and importance ? 

Our preliminary investigations were based on the 
self/auto-combination of the sources. For example, 
if one has the importances factors (3i = 0.7 for the 
source Si and (3 2 = 0.3 for the source S 2 , one could 
imagine to combine 7 times the bba mi(.) with it- 
self, combine 3 times the bba m 2 (.) with itself, and 
then combine the resulting auto-fusioned bba’s because 
such combination would reflect somehow the relative 
importance of the source in the fusion process since 
(3\/(3 2 = 0.7/0. 3 = 7/3. Actually such approach is very 
disputable and cannot be used satisfactorily in practice 
whatever the fusion rule is adopted. It can be easily 
shown that the auto-conflict tends quickly to 1 after 
several auto- fusions [3]. In other words, the combina- 
tion result of N x (3\ bba’s mj(.) with M x (3 2 bba’s 
W 2 (.) is almost the same for any N and M sufficiently 
large, so that the different importances of sources are 
not well preserved in such approach. The numerical 
complexity of such method must be pointed out since 
it would require to compute possibly many auto- fusions 
of each source which is a very time-consuming computa- 
tional task. For example, if (3\ = 0.791 and (3 2 = 0.209, 
it would require to combine at least 791 auto-fusions of 
mi(.) with 209 auto-fusions of m 2 (.) !!! 

In this paper, we propose a better solution to con- 
sider importances of sources. Our new approach can 
be considered as the dual of Shafer’s discounting ap- 
proach for reliabilities of sources. The idea was origi- 
nally introduced briefly by Tacnet in [9], Vol.3, Chap. 
23, p. 613. It consists to define the importance dis- 
counting with respect to the empty set rather than the 
total ignorance O (as done by Shafer in reliability dis- 
counting presented in section 3) . Such new discounting 
technique allows to deal easily with sources of different 
importances and is also very simple to use as it will be 
shown. 



Definition (importance discounting): We define the 
importance discounting of a source having the impor- 
tance factor (3 and asociated bba m(.) by 

mp(X) = (3 ■ m(X), for A ^ 0 
m^) = /3 ■ m(0) + (1 — (3) 

Note that with this importance discounting approach, 
we allow to deal with non- normal bba since mp{$) > 0. 
The interest of this new discounting is to preserve the 
specificity of the primary information since all focal 
elements are discounted with same importance factor 
and no mass is committed back to partial or total 
ignorances. Working with positive mass of belief on 
the empty set is not new and has been introduced in 
nineties by Smets in his transferable belief model [10]. 
Here we use the positive mass of the empty set as an 
intermediate/preliminary step of the fusion process. 
Clearly when (3 = 1 is chosen by the fusion designer, it 
will mean that the source must take its full importance 
in the fusion process and so the original bba m(.) is 
kept unchanged. If the fusion designer takes (3 = 0, one 
will deal with mp(%) = 1 which must be interpreted 
as a fully non important source. m(0) > 0 is not 
interpreted as the mass committed to some conflicting 
information (classical interpretation), nor as the mass 
committed to unknown elements when working with 
the open- world assumption (Smets interpretation), but 
only as the mass of the discounted importance of a 
source in this particular context. 

Before going further, it is worth to note that 
Dempster’s rule cannot deal properly with importance 
discounted bba’s proposed in (5) because our impor- 
tance discounting technique preserves the specificity 
of the primary information and Dempster’s rule does 
not make a difference in results when combining m i(.) 
with W 2 (.) or when combining m/ 3 15 ^i(.) with m/ 3 2 ^i(.) 
due to the way of processing the total conflicting mass 
of belief. This can be stated as the following theorem: 

Theorem 1: Dempster’s rule is not responding to 
the discounting of sources towards the emptyset. 
Proof: Let m/.) and m 2 (.) be two bba’s defined 
on the fusion space G e = {Ai, A 2 , . . . , A„}. Let 
mi(Xi) = at for all i, with x ai = 1, and all 
at in [0,1], and let m 2 (Xi) = bi for all i, with 
Yh=i^ = 1) and all b t in [0,1]. mi(0) = m 2 (0) = 0. 
After discounting both m\(.) and m 2 (.) towards the 
emptyset with (3\ and respectively (3 2 in [0,1], we 
get: (/ 3i)m 1 (X i ) = (/ 3i)a z for all i, with X//=i a* = 1, 
and all a/ in [0,1], also (/3i)mi(0) = 1 — (3\, and 
( (3 2 )m 2 (Xi ) = {P 2 )bi for all i, with YZ=i b i = 
and all bi in [0,1], also (f3 2 )m\((b) = 1 — (3 2 . If we 
apply the conjunctive rule to mi(.) and m 2 (.) we get: 
r mi 2 (Xi) = Ci , with X)™ =1 Cj = 1 and Cj in [0, 1], where 
some Xi could be empty intersections. Suppose the 



32 




Advances and Applications of DSmT for Information Fusion. Collected Works. Volume 4 



non-empty resulted sets after applying the conjunctive 
rule are: X ix , X ip . Then Dempster’s rule gives 
m DS (X ik ) = mi 2 (X ik )/(m 12 (X il )+. . .+m 12 {X ip )) 1 for 
1 < k < p. If we apply the conjunctive rule to (pi)nii (.) 
and (P 2 )m 2 (.) we get: (/?i)(/? 2 )mi 2 (Xj) = (0i)(j3 2 )ci, 
with (/?i )(/3 2 )cj in [0,1], where some Xi could be 
empty sets, and (/?i)(/3 2 )mi 2 (0 7 ) = 1 - (/?i) (/3 2 ). Now 
Dempster’s rule normalizes the conjunctive result of 
non empty sets by dividing the mass of each nonempty 
set by the sum of all non-empty sets. The non-empty 
resulted sets after applying the conjunctive rule are the 
same: X h , . .., X ip . Then: (Pi)(P 2 )m D s(X ik ) = 
(/3i)(/3 2 )fTH 2 (^i fc )/((/3i)(/3 2 )mi 2 (Xj 1 ) + ... + 

(Pi)mm 12 (X ip )) = mi 2 (X i J/(mi 2 (.X'j 1 ) + ... + 
1 ^ 12 (Xi p )) = rriDs(Xi k ) since the whole fraction is sim- 
plified by (Pi )(/? 2 ), for 1 < k < p. Hence Dempster’s 
rule is not responding to the discounting of sources 
towards the empty set. 

From Theorem 1, one understands why such impor- 
tance discounting technique has never been proposed 
and used in DST framework and this explains why the 
classical Shafer’s discounting technique (the reliability 
discounting) has only been largely adopted so far. 
By using Dempster’s rule, the fusion designer has no 
other choice but to consider importance and reliability 
as same notions! As it will be shown, the DSmT 
framework with PCR5 (or PCR6) rule and the impor- 
tance discounting technique proposed here provides 
an interesting and simple solution for the fusion of 
sources with different importances which makes a clear 
distinction between importances and reliabilities of 
sources. 

Fusion of importance discounted bba’s: Based on 
this new discounting technique, it is however very sim- 
ple to adapt PCR5 or PCR6 fusion rules for combining 
the s > 2 discounted bba’s associated with each source 
i, i = 1, 2, . . . s. It suffices actually to consider the fol- 
lowing extension of PCR5, denoted PCR50 and defined 
by: 

• For two sources (s = 2): VA £ G° (A may be the 
empty set too) 



For s > 2 sources: VA £ G'° (A may be the empty 
set too) 



(A) = mi 2 ... s (A)+ 

E 



2<t<s 

l<ri,...,r t <s f . . x — 

l<r 1 <r 2 <...< rt - 1 <(r t =s) 

{ii,...,is}e'P s ({l,...,s}) 



E 

X 32 EG 0 



(i r^im iki (Ap) ■ [nu(ni;, ri _ 1+1 m itl ft,)] 



(ni-u™ ifcl (A)) + EU(mu 



i= 2 UU- ! =r ,_ 1 + l ' 



( 7 ) 



where i, j, k, r, s and t in (7) are integers. 
mi 2 ... s (A) = is the 

n s i= 1 Xi=A 

conjunctive consensus on A between s sources 
and where all denominators are different from 
zero. If a denominator is zero, that fraction is 
discarded; V k ({l, 2, . . . , n}) is the set of all subsets 
of k elements from (1,2, ...,n} (permutations 
of n elements taken by k), the order of elements 
doesn’t count. 



A similar extension can be done for the PCR6 for- 
mula for s > 2 sources given in [9], Vol. 2. More pre- 
cisely for any A in G ,e (A may be the empty set too) 
we define: 

mpcm^A) = mi 2 ... s (A)+ 

S — 1 

E E E 

A'i,X 2 ,...,X s _ieG e fc=1 (ti,i2,...,ts)€P(l,2,...,s) 
Xi^A,ie{l,2,...,s-l} 

(n®r 1 1 A' i )nA=0 

Kj (A) + m l2 ( A) + . . . + m ik (A)] x 

njU mg (A) ripEi m lp (X p ) 

T,U m iM) + K=l+i m ^x P ) 

where P(l,2,...,s) is the set of all permutations of 
the elements {1,2,..., s}. It should be observed that 
X \ , X 2 , ..., X s _i may be different from each other, 
or some of them equal and others different, etc. 



E 



Ai,A 2 €G e 
AiDA 2 — A 



'mpcR5< ll (A) = 

\ ' r mi(A) 2 m 2 (X) 
m i(A) + m 2 (X) 

A GCx 
AflA=0 



m 1 (X 1 )m 2 (X 2 )+ 

m 2 (A) 2 mi(X) 
m 2 (A) + mi(X) 



7 i.e. the absolute empty set, not that resulted from set inter- 
sections which are empty. 



As a particular case for s = 3 sources, one gets for 
any A in G e ( A may be the empty set too): 



] (6) fnpcmn,(A) = m\ 2 z(A) + 



E E 

x,yeG e (ii,i 2 ,i3)eP(l,2,3) 
x^a,y^a 

XnYnA=0 



mq (A) 2 m.j 2 ( X)m i3 ( Y ) 
mq (A) + m i2 (X) + m i3 (Y) 

m h ( A)m i2 (A)m i3 (A) 



[mu (A) + m i2 (A)] 



m il (A) + rm 2 (A) + m i3 (X ) 



(9) 
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where 777.123(4!) is the mass of the conjunctive consensus 
on A and P( 1, 2, 3) is the set of all permutations of the 
elements {1,2,3}. It should be observed that X may 
be different or equal to Y. 

The difference between formulas (3) and (6) is that 
mpcR. 5(0) = 0 whereas m PC R 5 0 (0) > 0. Of course, 
since we usually need to work with normal bba’s for 
decision-making support, the result mpcR 5 e {-) , or 
mpcR6 e {-), of the fusion of discounted masses m^}.) 
will be normalized by redistributing the mass of belief 
committed to the empty set to the other focal elements 
and proportionally to their masses (see next example). 

5 Example 

For convenience and simplicity, and due to space lim- 
itation constraint, we give a very simple example work- 
ing on the classical power set 2 e since most of read- 
ers familiar belief functions usually work with this fu- 
sion space. Example 1: Let’s consider 0 = {A,B}, 
Shafer’s model, and two sources with respectively bba’s 
mi(.) and m2(.) given by mi (A) = 0.8, m\(B) = 0.2 
and 777-2(4!) = 0.4, m 2 (P) = 0.6. 

• Case 1 (no importance discounting): Let’s con- 
sider that /5i = 1 and P 2 = 1, be. the sources 
must have the same maximal importance in the 
fusion rule. In that case, one gets: mp 1 (.) = 
mi(.) and m.p 2 {.) = m^i-) and the bba’s are ac- 
tually not discounted. The conjunctive rule gives 
^12(4!) = 0.32, m\ 2 {B) = 0.12 and the mass 
77112(4! fl B = 0) = 0.56 is redistributed back to 
4l and B proportionally to their masses following 
the PCR5 principle explained in section 2. We get 
the following result: 





«01=l(-) 


"02 = 1 (•) 


77112 (•) 


mpcRs{.) 


0 


0 


0 


0.56 


0 


A 


0.8 


0.4 


0.32 


0.64 


B 


0.2 


0.6 


0.12 


0.36 



Table 1: PCR5 fusion of m f 3 1= i(.) with mp 2 =\{.). 



• Case 2 (with importance discounting): Let’s take 
now the importances factors P\ = 0.2 and @2 = 0.8 
(note that in general we don’t need to force the 
sum of (3i to be one, unless one wants to deal with 
relative importances between sources). Applying 
importance discounting technique and normaliza- 
tion of mpcR, 5 0 , denoted Wpci?5 0 (-)> one gets: 





mp 1 =o.2(.) 


mp 2= o.$(.) 


mi 2 (.) 


_ norm / \ 

m PCR5 u (-) 


0 


0.80 


0.20 


0.9296 


0 


A 


0.16 


0.32 


0.0512 


0.43 


B 


0.04 


0.48 


0.0192 


0.57 



Table 2: PCR5 fusion of m^ 1= 0.2O) with mp 2 -o.8{-)- 



Clearly, one sees in Table 2 the strong impact of the 
importance discounting on the result with respect 
to what we obtain in Table 1 (i.e. without impor- 
tance discounting). Note also that the difference 
is very different to what we would have obtained 
by taking a.\ = 0.2 and 0:2 = 0.8 and using the 
reliability discounting approach as seen in Table 3. 





?7t ai= 0.2(.) 


m a 2 =o. 8(.) 


m 12 (.) 


m-PCRsi-) 


0 


0 


0 


0.0896 


0 


A 


0.16 


0.32 


0.3392 


0.3698 


B 


0.04 


0.48 


0.4112 


0.4702 


aub 


0.80 


0.20 


0.1600 


0.1600 



Table 3: PCR5 fusion of m ai — 0.2O) with m a2 —o.s(-)- 

By comparing Table 2 with Table 3, one sees the 
clear difference in results obtained by these two 
discounting techniques which is normal. 

6 Reliability and importance 

In this section, we examine the possibility to take into 
account both the reliabilities 07 and the importances 
Pi of given sources of evidence characterized by their 
bba’s rn,(.), i = 1,2, ... ,s. The main question is how 
to deal with these two distinct discounting factors since 
in general, but when a* = Pi = 1, the reliability and 
importance discounting approaches do not commute. 
Indeed, it can be easily verified (see in next example) 
that m ai)i g 4 (.) 7^ mpi,ai{-) whenever 07 7^ 1 and Pi 7^ 1. 
TO a»,/3»(-) denotes the reliability discounting of TOj(.)by 
a.i followed by the importance discounting of m ai (.) 
by Pi which explains the notation 07 , Pi used in index. 
Similarly, denotes the importance discounting 

of ?rq(.) by /3, followed by the reliability discounting of 
in pi ( . ) by a,;. To deal both with reliabilities and impor- 
tances factors and because of the non commutativity of 
these discountings, we propose to proceed the fusion of 
the sources in a three-steps process as follows: 

Method 1: Step 1: Apply reliability and then impor- 
tance discountings to get rn ctij p i (.), i = l,...,s and 
combine them with PCRbq or PCP60 and normalize 
the resulting bba; Step 2: Apply importance and then 
reliability discountings to get mp itCei (.),* = l,...,s and 
combine them with PCRbq or PCP60 and normalize 
the resulting bba; Step 3 (mixing/averaging): Combine 
the resulting bba’s of Steps 1 and 2 using the arithmetic 
mean operator 8 . 

Method 2: Another simplest method which could be 
useful for intermediate traceability in some applications 
would consist to change Steps 1 & 2 by Step V: Apply 
reliability discounting only to get m ai {.) and combine 
them with PCR5 or PCR6; Step 2’: Apply importance 
discounting only to get mp i (.) and combine them with 

8 Other combination rules could be used also like PCR5 or 
PCR6, etc., but we don’t see solid justification to use them again 
and they require more computations than the simple arithmetic 
mean. 



34 







Advances and Applications of DSmT for Information Fusion. Collected Works. Volume 4 



PCR5Q or PCR6 0 and normalize the result; Step 3’ 
same as Step 3. Due to space limitation, only Method 
1 is briefly illustrated in the following simple example. 

Example 2: Let’s take 0 = { A , B , C}, Shafer’s model, 
three sources mi(.), m 2(.) and rri3(.) given in next table 
and assume that their reliability factors are a.\ = 0.8, 
ct2 = 0.5, and <23 = 0.2 and their importance factors 
are fli = 0.9, fa = 0.3 and /I3 = 0.6. 





m-\ (.) 


™2(0 


m 3 ( • ) 


0 


0 


0 


0 


A 


0.8 


0.4 


0.1 


B 


0 


0.3 


0.3 


AU B 


0.1 


0.2 


0 


C 


0 


0 


0.5 


AU C 


0.1 


0 


0 


B U C 


0 


0.1 


0 


A U B U C 


0 


0 


0.1 



Table 4: Sources of evidences. 



By applying reliability followed by importance dis- 
counting, and by applying importance followed by reli- 
ability discounting, one gets: 





(•) 


m °‘2,p2 (0 


m a3 ,p 3 (•) 


0 


0.1000 


0.7000 


0.4000 


A 


00.5760 


0.0600 


0.0120 


B 


0 


0.0450 


0.0360 


aub 


0.0720 


0.0300 


0 


c 


0 


0 


0.0600 


AUC 


0.0720 


0 


0 


BUC 


0 


0.0150 


0 


AUBUC 


0.1800 


0.1500 


0.4920 



Table 5: Reliability-Importance discounting. 





(•) 


m p2,^2 (0 


rn 0 3 ,cz 3 (•) 


0 


0.0800 


0.3500 


0.0800 


A 


0.5760 


0.0600 


0.0120 


B 


0 


0.0450 


0.0360 


AUB 


0.0720 


0.0300 


0 


C 


0 


0 


0.0600 


AUC 


0.0720 


0 


0 


BUC 


0 


0.0150 


0 


AUBUC 


0.2000 


0.5000 


0.8120 



Table 6: Importance- Reliability discounting. 



The normalized results of the PCR5 0 fusion of 
m aij 3i(0 f° r i = 1,2,3 (Step 1) and PCR5 0 fusion of 
(•) for i = 1,2,3 (Step 2) is given in next Table 
7 with their arithmetic mean fhpcR s(-) (Step 3). 

7 Conclusions 

The proposition of two different discounting tech- 
niques is an important contribution to consider both 
preferences and reliability in fusion problems for deci- 
sion making purposes. In this paper, we have proposed 
a new solution for taking into account the different im- 
portances of sources of evidence in their combination. 





Step 1 


Step 2 


Step 3 




m PCR5a, a .g{-) 


rn PCR5@ g a \v 


mpcRsf) 


0 


0 


0 


0 


A 


0.5741 


0.4927 


0.5334 


B 


0.0254 


0.0244 


0.0249 


AUB 


0.0311 


0.0464 


0.0388 


C 


0.0182 


0.0182 


0.0182 


AUC 


0.0233 


0.0386 


0.0310 


BUC 


0.0032 


0.0032 


0.0032 


AUBUC 


0.3247 


0.3765 


0.3506 



Table 7: Results of Steps 1, 2 & 3. 



We have shown the clear distinction between the clas- 
sical reliability discounting technique and our new im- 
portance discounting method which can be used with 
extensions of PCR5 and PCR6 fusion rules developed in 
DSmT framework. It has been shown also that Demp- 
ster’s rule cannot be applied satisfactorily with this im- 
portance discounting approach contrariwise to PCR5 
and PCR6 rules. The importance and reliability can 
now be distinguished in the fusion of sources which in- 
troduces a link with Multi-Criteria Decision Problems 
in the fusion of sources of information. Applications 
of these techniques for risk prevention against natural 
catastrophes in mountains are under progress and re- 
sults will be published in forthcoming publications. 
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Appendix: Matlab™code listings 
for PCR5 and PCR6 

For convenience, we provide two Matlab™routines 
for PCR5 and PCR6 for the fusion of s > 2 sources 
for working with 2®, i.e. working with Shafer’s model. 
Some adaptations need to be done to work on other 
fusion spaces and to work with PCR5 q and PCR6®. 
No verification of input is done in the routines. It 
is assumed that the input matrix BBA is correct, 
both in dimension and in content. No attempt for 
fast computation, nor memory optimization is done 
in these very simple and basic codes. The deriva- 
tion of all possible combinations in the loop with 
combvec (Combinations ,vec) instruction is a very 
time-consuming task when the size of the problem 
increases and should be done once outside the routines. 
The j-th column of the BBA input matrix corresponds 
to the (vertical) bba vector associated with the 

j-th source Sj. Each element of a BBA matrix is in 
[0,1] and the sum of each column must be one. If N is 
the cardinality of the frame 0 and if S is the number 
of sources, then the size of the BBA input matrix is 
((2^) — 1)) x S. Each column of the BBA matrix 
must use the following binary encoding of elements 9 
of 2° \ {0}. For example, if 0 = {A, B,C}, then 
binary sequence 001 = A, 010 = B, Oil = AU B, . . . , 
lll = AUi3UC. These codes can be used and shared 
for free for research purposes only. Commercial uses of 
these codes, or any adaptation of them, is not allowed 
without written agreement of the author. The use of 
these codes are at the own risk of the user. 



9 Since one always considers normal input bba’s such that 
rrij (0) = 0, j = 1, . . . S, one doesn’t need to store these val- 
ues in the BBA matrix. For PCR5(j and PCRGq however, one 
needs to include as first row of BBA the nij (0) > 0 resulting 
from importance discounting of the sources and make a proper 
adaptation of indexes in the routines. 



File : PCR5fusion.m 



function [mPCR5,TotalConflict]=PCR5fusion( BBA ) 

'/, Author and copyrights : Jean Dezert 
'/, Input : BBA matrix 

'/, Output : mPCR5 = resulting bba after fusion with PCR5 
7. TotalConf lict = level of total conflict between sources 
NbrSources=size(BBA,2) ; 

CardTheta=log2 (size (BBA, 1)+1) ; 
if (NbrSources==l) 

mPCR5=BBA( : , 1) ;TotalConf lict=0; return 
end 

Card2PowerTheta=2“ (CardTheta) -1 ; 

•/. All possible combinations 
vec=[l :Card2PowerTheta] ; 

Combinations=vec ; 
for s=l :NbrSources-l 

Combinations=combvec (Combinations , vec) ; 

Combinations=Combinations’ ; 
mPCR5=zeros (Card2PowerTheta , 1) ; 

TotalConf lict=0 ; 

NbrComb=size (Combinations, 1) ; 
for c=l :NbrComb 
PC=Combinations (c , : ) ; 
mConj =zeros(l .NbrSources) ; 
for s=l : NbrSources 
mConj (s)=BBA(PC(s) ,s) ; 

massConj=prod(mConj ,2) ; 
if (massConj>0) 

'/, Check if this is a real partial conflict or not 
Intersections=PC(l) ; 
for s=2 : NbrSources 
X=PC(s) ; 

Intersections=bitand(Intersections,X) ; 

if (Intersections' 3 !)) I the intersection is not empty 
mPCR5 (Intersect ions) =mPCR5 ( Intersections) +massCon j ; 
else / the intersection is empty 
TotalConf lict=TotalConf lict +massConj ; 

I Let’s apply PCR5 rule principle 
UQ=unique (PC) ; 

Proport ions=0*UQ ; 

DenPCR5=0 ; 

for u=l :size(UQ,2) 

SameProposit ions=f ind (PC==UQ (u) ) ; 

MassProd=prod (mConj (SamePropositions) ) ; 

Proport ions(u)= MassProd*massConj ; 

DenPCR5=DenPCR5+MassProd ; 

end 

Proportions=Proportions/DenPCR5 ; 

PCR5 redistribution 
for u=l :size(UQ,2) 

mPCR5 (UQ (u) ) =mPCR5 (UQ (u) ) +Proport ions (u) ; 
end, end, end, end, return 



File : PCR6fusion.m 



function [mPCR6,TotalConflict]=PCR6fusion(BBA) 

'/, Author and copyrights : Jean Dezert 
•/. Input : BBA matrix 

'/, Output : mPCR6 = resulting bba after fusion with PCR6 
'/, TotalConf lict = level of total conflict between sources 
NbrSources=size (BBA , 2) ; 

CardTheta=log2 (size (BBA, 1)+1) ; 
if (NbrSources==l) 
mPCR6=BBA(: ,1) ; 

TotalConf lict=0 ; 

return 

end 

Card2PowerTheta=2“ (CardTheta) -1 ; 

'/, All possible combinations 
vec=[l :Card2PowerTheta] ; 

Combinations=vec ; 
for s=l :NbrSources-l 

Combinations=combvec (Combinations , vec) ; 

Combinations=Combinations’ ; 
mPCR6=zeros(Card2PowerTheta, 1) ; 

TotalConf lict=0 ; 

NbrComb=size (Combinations, 1) ; 
for c=l :NbrComb 

PC=Combinations(c, :) ; 7. particular combination 

mConj =zeros(l .NbrSources) ; 
for s=l : NbrSources 
mConj (s)=BBA(PC(s) ,s) ; 

massConj=prod (mConj ,2) ; 
if (massConj>0) 

Intersections=PC(l) ; 
f or s=2 : NbrSources 
X=PC(s) ; 

Intersections=bitand(Intersections,X) ; 

if (Intersections' 3 *)) 7. intersection not empty 
mPCR6 (Intersect ions) =mPCR6 ( Intersections) +massCon j ; 
else / empty intersection 
TotalConf lict=TotalConf lict +massConj ; 

I PCR6 rule principle 
for s=l : NbrSources 

Proportion 3 mConj (s)*(massConj/ (sum (mConj ,2))) ; 

7. Redistribution back to element PC(s) 
mPCR6 (PC ( s ) ) =mPCR6 (PC ( s ) ) +Pr oport i on ; 
end, end, end, end, return 
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Is Entropy Enough to Evaluate the Probability 
Transformation Approach of Belief Function? 

Deqiang Han 
Jean Dezert 
Chongzhao Han 
Yi Yang 



Abstract — In Dempster- Shafer Theory (DST) of ev- 
idencee and transferable belief model (TBM), the prob- 
ability transformation is necessary and crucial for 
decision-making. The evaluation of the quality of the 
probability transformation is usually based on the en- 
tropy or the probabilistic information content (PIC) 
measures, which are questioned in this paper. Another 
alternative of probability transformation approach is 
proposed based on the uncertainty minimization to ver- 
ify the rationality of the entropy or PIC as the evalua- 
tion criteria for the probability transformation. Accord- 
ing to the experimental results based on the comparisons 
among different probability transformation approaches, 
the rationality of using entropy or Probabilistic Infor- 
mation Content (PIC) measures to evaluate probability 
transformation approaches is analyzed and discussed. 

Keywords: TBM, uncertainty, pignistic probability 
transformation, evidence theory, decision-making. 

1 Introduction 

Evidence theory, as known as Dempster-Slrafer Theory 
(DST) [1,2] can reason with imperfect information in- 
cluding imprecision, uncertainty, incompleteness, etc. 
It is widely used in many fields in information fusion. 
There are also some drawbacks and problems in evi- 
dence theory, i.e. the high computational complexity, 
the counter-intuitive behaviors of Dempster’s combi- 
nation rule and the decision-making in evidence the- 
ory, etc. Several modified, refined or extended mod- 
els were proposed to resolve the problems aforemen- 
tioned, such as transferable belief model (TBM) [3] pro- 
posed by Philippe Srnets and Dezert-Smarandaclre The- 
ory (DSmT) [4] proposed by Jean Dezert and Florentin 
Smarandache, etc. 



Originally published as Han D., Dezert J., Han C., Is Entropy Enough to 
Evaluate the Probability Transformation Approach of Belief Function?, 
Fusion 2010, Edinburgh, Scotland, UK, 26-29 July 2010, and reprinted 

with permission. 

The goal of uncertainty reasoning is the decision- 
making. To take a decision, the belief assignment val- 
ues for a compound focal element should be at first 
assigned to the singletons. So the probability transfor- 
mation from belief function is crucial for the decision- 
making in evidence theory. The research on probability 
transformation has attracted more attention in recent 
years. 

The most famous probability transformation in evi- 
dence theory is the pignistic probability transformation 
(PPT) in TBM. TBM has two levels including credal 
level and pignistic level. At the credal level, beliefs 
are entertained, combined and updated while at the 
pignistic level, the PPT maps the beliefs defined on 
subsets to the probability defined on singletons, then a 
classical probabilistic decision can be made. In PPT, 
belief assignment values for a compound focal element 
are equally assigned to the singletons belonging to the 
focal element. In fact, PPT is designed according to 
principle of minimal commitment, which is somehow 
related with uncertainty maximization. But the goal 
of information fusion at decision-level is to reduce the 
uncertainty degree. That is to say more uncertainty 
might not be helpful for the decision. PPT uses equal 
weights when splitting masses of belief of partial un- 
certainties and redistributing them back to singletons 
included in them. Other researchers also proposed some 
modified probability transformation approaches [5-13] 
to assign the belief assignment values of compound fo- 
cal elements to the singletons according to some ra- 
tio constructed based on some available information. 
The typical approaches include the Sudano’s probabili- 
ties [8] and the Cuzzolin’s intersection probability [13], 
etc. In the framework of DSmT, another probability 
transformation approach was proposed, which is called 
DSrnP [9]. DSmP takes into account both the values of 
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the masses and the cardinality of focal elements in the 
proportional redistribution process. DSrnP can also be 
used in Shafer’s model within DST framework. 

In almost all the research works on probability trans- 
formations, the entropy or Probabilistic Information 
Content (PIC) criteria are used to evaluate the prob- 
ability transformation approaches. Definitely for the 
purpose of decision, less uncertainty should be better 
to make a more clear and solid decision. But does the 
probability distribution generated from belief functions 
with less uncertainty always rational or always be ben- 
efit to the decision? We do not think so. In this pa- 
per, an alternative probability transformation approach 
based on the uncertainty minimization is proposed. 
The objective function is established based on the Shan- 
non entropy and the constraints are established based 
on the given belief and plausibility functions. The ex- 
perimental results based on some provided numerical 
examples show that the probability distributions gen- 
erated based on the proposed alternative approach have 
the least uncertainty degree when compared with other 
approaches. When using the entropy or PIC to evalu- 
ate the proposed probability transformation approach, 
the probability distribution with the least uncertainty 
seemingly should be the optimal one. But some risky 
and strange results can be derived in some cases, which 
are illustrated in some numerical examples. It can be 
concluded that the entropy or PIC, i.e. the uncertainty 
degree might not be enough to evaluate the probability 
transformation approach. In another word, the entropy 
or PIC might not be used as the only criterion to make 
the evaluation. 



2 Basics of evidence theory and 

probability transformation 

2.1 Basics of evidence theory 

In Dempster-Slrafer theory [2], the elements in the 
frame of discernment (FOD) 0 are mutually exclusive. 
Define the function m : 2 e — >■ [0, 1] as the basic prob- 
ability assignment (BPA, also called mass function), 
which satisfies: 

4 ce = ° (b 

Belief function and plausibility function are defined 
respectively in (2) and (3): 



Bel (A) = A m(B) 


(2) 


pi (A) = Y m(B) 


(3) 



and Dempster’s rule of combination is defined as fol- 
lows: mi, m 2 , m n are n mass functions, the new com- 



bined evidence can be derived based on (4) 



m(A ) 



0, A = 0 

£ fl rm(Ai) 

E TI rm(Ai) 

fl l<i<n 



A^0 



(4) 



Dempster’s rule of combination is used in DST to 
accomplish the fusion of bodies of evidence. But the fi- 
nal goal of the information fusion at decision-level is to 
make the decision. The belief function (or BPA, plausi- 
bility function) should be transformed to the probabil- 
ity, before the probability-based decision-making. Al- 
though there are also some research works on making 
decision directly based on belief function or BPA [14], 
probability-based decision methods are the develop- 
ment trends of uncertainty reasoning and theories [15]. 
This is because the two-level reasoning and decision 
structure proposed by Srnets in his TBM is appealing. 

2.2 Pignistic transformation 

As a type of probability transformation approach, the 
classical pignistic probability in TBM framework was 
coined by Philippe Smets. TBM is a subjective and 
non probabilistic interpretation of evidence theory. It 
extends the evidence theory to the open-world propo- 
sitions and it has a range of tools for handling belief 
functions including discounting and conditioning, etc. 
At the credal level of TBM, beliefs are entertained, com- 
bined and updated while at the pignistic level, beliefs 
are used to make decisions by transforming beliefs to 
probability distribution based on pignistic probability 
transformation (PPT). The basic idea of the pignistic 
transformation consists in transferring the positive be- 
lief of each compound (or nonspecific) element onto the 
singletons involved in that element split by the cardi- 
nality of the proposition when working with normalized 
BPAs. 

Suppose that 0 = {6b, O 2 , ..., 0 n } is the FOD. The 
PPT for the singletons is illustrated as follows [3]: 

BetP„(# ( ) = y ^ (5) 

9,65, BC2 e 1 1 



where 2 e is the power set of the FOD. Based on the 
pignistic probability derived, the corresponding deci- 
sion can be made. 

But in fact, PPT is designed according to the idea 
being similar to uncertainty maximization. In general, 
the PPT is just a simple averaging operation. The mass 
value is not assigned discriminately to the different sin- 
gletons involved. But for information fusion, the aim is 
to reduce the degree of uncertainty and to gain a more 
consolidated and reliable decision result. The high un- 
certainty in PPT might be not helpful for the decision. 
Several researchers aim to modify the traditional PPT. 
Some typical modified probability transformation ap- 
proaches are as follows. 
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1) Sudano’s probabilities: Sudano [8] proposed 
some interesting alternatives to PPT denoted by PrPl, 
PrNPl, PraPl, PrBel and PrHyb, respectively. Sudano 
uses different kinds of mappings either proportional to 
the plausibility, to the normalized plausibility, to all 
plausibilities and to the belief, respectively or a hybrid 
mapping. 

2) Cuzzolin’s intersection probability: In the 

framework of DST, Fabio Cuzzolin [13] proposed an- 
other type of transformation. From a geometric inter- 
pretation of Dempster’s combination rule, an intersec- 
tion probability measure was proposed from the propor- 
tional repartition of the total non specific mass (TNSM) 
by each contribution of the non-specific masses involved 
in it. 

3) DSmP: Dezert and Smarandache proposed the 

DSmP as follows: Suppose that the FOD is 0 = 

{6\, the DSmP e (dj)can be directly obtained by: 



DSmP e (#j) = m({9i}) + (m({0j}) + e)- 



( E 

fiCX Vex 

|X|>2 |Y| = i 



m(X) ' 

5D m(Y)+s-\X\ - 



( 6 ) 



In DSmP, both the values of the mass assignment and 
the cardinality of focal elements are used in the pro- 
portional redistribution process. DSmP does an im- 
provement of all Sudano, Cuzzolin, and BetP formulas, 
in the sense that DSmP mathematically makes a more 
accurate redistribution of the ignorance masses to the 
singletons involved in ignorance. DSmP works in both 
theories: DST and DSmT as well. 



3 An alternative probability 
transformation based on un- 
certainty minimization 

3.1 Evaluation criteria for probability 
transformation 



The metrics depicting the strength of a critical decision 
by a specific probability distribution are introduced as 
follows: 

1) Normalized Shannon entropy 

Suppose that pg is a probability distribution, where 
9 € 0, | ©| = N and the |0| represents the cardinality 
of the FOD 0. The evaluation criterion for the proba- 
bility distribution derived based on different probability 
transformation is as follows [12]. 



Eh 



- E Pe\og 2 (p e ) 

eee 

log 2 N 



(7) 



The dividend in (7) is the Shannon entropy and the di- 
visor in (7) is maximum value of the Shannon entropy 
for {pg\9 £ 0},|0| = N. Obviously E H is normalized. 
The larger the Eh is, the larger the degree of uncer- 
tainty is. The less the Eh is, the less the degree of un- 
certainty is. When Eh= 0, there is only one hypothesis 
has a probability value of 1 and the rest has 0, the agent 
or system can make decision correctly. When Eh= 1, 
it is impossible to make a correct decision, because all 
the pg, V0 £ 0 are equal. 

2) Probabilistic Information Content 
Probabilistic Information Content (PIC) criterion is 
an essential measure in any threshold-driven automated 
decision system. A PIC value of one indicates the total 
knowledge to make a correct decision. 

PIC (P) = 1 + 1q 1 N ■ Y Po lo & (Pe) (8) 



There are still some other definitions on modified 
PPT such as the iterative and self-consistent approach 
PrScP proposed by Sudano in [5] and a modified PrScP 
in [12]. Although the approaches aforementioned are 
different, all the probability transformation approaches 
are evaluated based on the degree of uncertainty. Less 
uncertainty means that the corresponding probability 
transformation result is better. According to such a 
idea, the probability transformation approach should 
attempt to enlarge the belief differences among all the 
propositions and thus to derive a more reliable decision 
result. Is this definitely rational? Is the uncertainty 
degree always proper or enough to evaluate the prob- 
ability transformation? In the following section, some 
uncertainty measures are analyzed and an alternative 
probability transformation approach based on uncer- 
tainty minimization is proposed to verify the rationality 
of the uncertainty degree as the criteria for evaluating 
the probability transformation. 



Obviously, PIC = 1 — Eh- The PIC is the dual of 
the normalized Shannon entropy. A PIC value of zero 
indicates that the knowledge to make a correct deci- 
sion does not exist (all the hypotheses have an equal 
probability value), i.e. one has the maximal entropy. 

As referred above, for information fusion at decision- 
level, the uncertainty seemingly should be reduced as 
much as possible. The less the uncertainty in prob- 
ability measure is, the more consolidated and reliable 
decision can be made. Suppose such a viewpoint is 
always right and according to such an idea, an alter- 
native probability transformation of belief function is 
proposed. 

3.2 Probability transformation of belief 
function based on uncertainty min- 
imization 

To accomplish the probability transformation, the be- 
lief function (or the BPA, the plausibility function) 
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should be available. The relationship between the prob- 
ability and the belief function are analyzed as follows. 

Based on the viewpoint of Dempster and Shafer, the 
belief function can be considered as a lower probabil- 
ity and the plausibility can be considered as an upper 
probability. Suppose that pg G [0, 1] is a probability 
distribution, where 9 G 0. For a belief function defined 
on FOD 0, suppose that B G 2 e , the inequality (9) is 
satisfied: 



Bel(B) <^2 geB pg < Pl(B) (9) 

This inequality can be proved according to the proper- 
ties of the upper and lower probability. 

Probability distributions (pg\9 G 0) also must meet 
the usual requirements for probability distributions, i.e. 

O - v P9 - 1,V ^ 0 ( 10 ) 

It can be taken for granted that there are several proba- 
bility distributions {pg\9 G 0} consistent with the given 
belief function according to the relationships defined in 
(9) and (10). This is a multi-answer problem or one-to- 
rnany mapping relation. As referred above, the proba- 
bility is used for decision, so the uncertainty seemingly 
should be as little as possible. We can select one prob- 
ability distribution from all the consistent alternatives 
according to the uncertainty minimization criterion and 
use the corresponding probability distribution as the re- 
sult of the probability transformation. 

The Shannon entropy is used here to establish the 
objective function. The equations and inequalities in 
(9) and (10) are used to establish the constraints. The 
problem of probability transform of belief function here 
is converted to an optimization problem under con- 
straints as follows: 



, Mm {- E Pe\og 2 {p e )\ 

{pe|0e©} L eg© J 

r Bei(B)<j: 0 eB po<pi(B) (: " 

s.t. < 0 < pe < 1, V0 G 0 
I E e^ePs = 1 

Given belief function (or the BPA, the plausibility), by 
solving (11), a probability distribution can be derived, 
which has least uncertainty measured by Shannon en- 
tropy and thus is seemingly more proper to be used in 
decision procedure. 

It is clear that the problem of finding a minimum en- 
tropy probability distribution does not admit a unique 
solution in general. The optimization algorithm used 
is the Quasi-Newton followed by a global optimization 
algorithm [16] to alleviate the effect of the local ex- 
tremum problem. Other intelligent optimization algo- 
rithms [17, 18] can also be used, such as Genetic Algo- 
rithm (GA), Particle Swarm Optimization (PSO), etc. 



4 Analysis based on examples 

At first, two numerical examples are first provided to 
illustrate some probability transformation approaches. 
To make the different approaches reviewed and pro- 
posed in this paper more comparable, the examples 
in [6, 12] are directly used here. The PIC is used to 
evaluate the probability transformation. 

4.1 Example 1 

For FOD 0 = { 01 , 02 , 03 ) 04 }) the corresponding BPA 
is as follows: 

m({0i» = 0.16, m({0 2 }) = 0.14, m({0 3 » = 0.01, 

m({0 4 }) = 0.02, 

m({0i,0 2 }) = 0.20, m({0i,0 3 }) = 0.09, 
m({0i,0 4 » = 0.04, m({0 2 , 0 3 }) = 0.04, 
m({0 2 , 0 4 |) = 0.02, m({0 3 , 0 4 }) = 0.01, 
m({0i,0 2 ,0 3 » =0.10, m({0i,0 2 ,0 4 » =0.03, 
m({0i,0 3 ,0 4 » =0.03, m({0 2 ,0 3 ,0 4 » =0.03, 
m(0) = 0.08. 

The corresponding belief functions are calculated and 
listed as follows: 

Bel({9 1 }) = 0.16, Bel{{6 2 }) = 0.14, 

Bel\{9 3 }) = 0.01, Bel{{6 4 }) = 0.02, 

Bel{{6 1 ,6 2 }) = 0.50, Bel({6 4 , 0 3 }) = 0.26, 

Bel({0 u 0 4 }) = 0.22, PeZ({0 2 , 0 3 }) = 0.19, 

PeZ({0 2 , 0 4 }) = 0.18, Bel({6 3 , 0 4 }) = 0.04, 

Bel({9 u 0 2 , 0 3 }) = 0.74, Bel({6 1 , 0 2 , 0 4 }) = 0.61, 
Bel({9 1 ,9 3l 0 4 }) = 0.36, Bel{{9 2 ,9 3 , 0 4 }) = 0.27, 
Bel(Q) = 1.00. 

The corresponding plausibility functions are calcu- 
lated and listed as follows: 

PZ({0 4 }) = 0.73, Pl({d 2 }) = 0.64, PZ({0 3 }) = 0.39, 
Pl({9i}) = 0.26, 

PZ({0i, 0 2 }) = 0.96, Pi({0i, 0 3 }) = 0.82, 

PZ({0r, 0 4 }) = 0.81, PI ({0 2 , 0 3 }) = 0.78, 

PZ({0 2 , 0 4 }) = 0.74, PZ({0 3 , 0 4 }) = 0.50, 

-PZ({0i,0 2 , 0 3 }) = 0.98, PZ({0i, 0 2 , 0 4 }) = 0.99, 
-PZ({0i,0 3 , 0 4 }) = 0.86, PZ({0 2 , 0 3 , 0 4 }) = 0.84, 

PZ(0) = 1.00. 

Suppose the probability distribution as the unknown 
variables. Based on the plausibility functions and the 
belief functions, the constraints and the objective func- 
tion can be established according to (11). The probabil- 
ity distribution can be derived based on the minimiza- 
tion. The results of some other probability transforma- 
tion approaches are also calculated. All the results are 
listed in Table 1 (on the next page) to make the com- 
parison between the approach proposed in this paper 
(denoted by Un_min) and other available approaches. 

4.2 Example 2 

For FOD 0 = {0i, 0 2 , 0 3 ,0 4 }, the corresponding BBA 
is as follows: 

m({0 1 }) = 0.05, m({0 2 }) = 0.00, m({0 3 }) = 0.00, 

m({0 4 }) = 0.00, 
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Table 1 Probability Transformation Results of 
Example 1 based on Different Approaches 





01 


02 


03 


04 


PIC 


BetP [3] 


0.3983 


0.3433 


0.1533 


0.1050 


0.0926 


PraPl [8] 


0.4021 


0.3523 


0.1394 


0.1062 


0.1007 


PrPl [8] 


0.4544 


0.3609 


0.1176 


0.0671 


0.1638 


PrHyb [8] 


0.4749 


0.3749 


0.0904 


0.0598 


0.2014 


PrBcl [8] 


0.5176 


0.4051 


0.0303 


0.0470 


0.3100 


FPT[11] 


0.5176 


0.4051 


0.0303 


0.0470 


0.3100 


DSmP_0[9] 


0.5176 


0.4051 


0.0303 


0.0470 


0.3100 


PrScP [10] 


0.5403 


0.3883 


0.0316 


0.0393 


0.3247 


PrBPl [12] 


0.5419 


0.3998 


0.0243 


0.0340 


0.3480 


PrBP2 [12] 


0.5578 


0.3842 


0.0226 


0.0353 


0.3529 


PrBP3 [12] 


0.0605 


0.3391 


0.0255 


0.0309 


0.3710 


Un_min 


0.7300 


0.2300 


0.0100 


0.0300 


0.4813 



m({ 6 1 , 0 2 }) = 0.39, m({0 1; 0 3 }) = 0.19, 
m({0 i, 0 4 }) = 0.18, m({0 2 , 0 3 }) = 0.04, 
m{{9 2 , 0 4 }) = 0.02, m({0 3 , 0 4 }) = 0.01, 
m({0 1 ,0 2 ,e 3 }) = 0.04, m({9 i,0 2 ,0 4 }) = 0.02, 
m({0!, 0 3 ,0 4 }) = o.o3, m{{e 2 ,e 3 ,e 4 }) = 0 . 03 , 
m(G) = 0.00. 

The corresponding belief functions are calculated and 
listed as follows: 

Bel({ 6 1 }) = 0.05, Bel({9 2 }) = 0.00, 

Bel({9 3 }) = 0.00, Bel({9 4 }) = 0.00, 

Bel{{9 1 , 0 2 }) = 0.44, Bel{{9 4 ,9 3 }) = 0.24, 
Bel{{9 ll 9 4 }) = 0.23, Bel{{9 2 ,9 3 }) = 0.04, 

Bel{{9 2 , 0 4 }) = 0.02, Bel{{9 3l 0 4 }) = 0.01, 

Bel({8 u 9 2 , 0 3 }) = 0.71, Bel({9 1 ,9 2 , 0 4 }) = 0.66, 
Bel{{9 1 , 0 3 , 0 4 }) = 0.46, Bel({9 2 , 0 3 , 9 4 }) = 0.10, 
Bel(<d) = 1.00. 

The corresponding plausibility functions are calcu- 
lated and listed as follows: 

Pl({9 4 }) = 0.90, Pl{{9 2 }) = 0.54, Pl({9 3 }) = 0.34, 
Pl{{0 4 }) = 0.29, 

Pl({9i,9 2 }) = 0.99, Pl({9 4 ,9 3 }) = 0.98, 

Pl{{9^9 4 }) = 0.96, Pl({9 2 ,9 3 }) = 0.77, 

Pl{{9 2 , 0 4 }) = 0.76, Pl({9 3 , 9 4 }) = 0.56, 
Pl({9 4l 0 2 ,9 3 }) = 1.00, Pl({9 4 ,9 2 ,9 4 }) = 1.00, 
Pl({0i,9 3 ,9 4 }) = 1.00, Pl({9 2 ,9 3 ,9 4 }) = 0.95, 

Pl{G) = 1.00. 

Suppose the probability distribution as the unknown 
variables. Based on the plausibility functions and the 
belief functions, the constraints and the objective func- 
tion can be established according to (11). The probabil- 
ity distribution can be derived based on the minimiza- 
tion. The results of some other probability transfor- 
mation approaches are also calculated. All the results 
are listed in Table 2 to make the comparison between 
the approach proposed in this paper and other available 
approaches. 

N/A in Table 2 means ’’Not available”. DSmP_0 
means the parameter e in DSmP is 0. 



Table 2 Probability Transformation Results of 
Example 2 based on Different Approaches 





0i 


02 


03 


04 


PIC 


PrBel [8] 


N/A due to 0 value of singletons 


FPT[11] 


N/A due to 0 value of singletons 


PrScP [10] 


N/A due to 0 value of singletons 


PrBPl [12] 


N/A due to 0 value of singletons 


PraPl [8] 


0.4630 


0.2478 


0.1561 


0.1331 


0.0907 


BetP [3] 


0.4600 


0.2550 


0.1533 


0.1317 


0.0910 


PrPl [8] 


0.6161 


0.2160 


0.0960 


0.0719 


0.2471 


PrBP2 [12] 


0.6255 


0.2109 


0.0936 


0.0700 


0.2572 


PrHyb [8] 


0.6368 


0.2047 


0.0909 


0.0677 


0.2698 


DSmP_0[9] 


0.5162 


0.4043 


0.0319 


0.0477 


0.3058 


PrBP3 [12] 


0.8823 


0.0830 


0.0233 


0.0114 


0.5449 


Un_min 


0.9000 


0.0900 


0.0000 


0.0100 


0.7420 



Based on the experimental results listed in Table 1 
and Table 2, it can be concluded that the probabil- 
ity derived based on the proposed approach (denoted 
by Un_min) has significantly lower uncertainty when 
compared with the other probability transformation ap- 
proaches. The difference among all the propositions can 
be further enlarged, which is seemingly helpful for the 
more consolidated and reliable decision. 

Important remark: In fact, there exist fatal deficien- 
cies in the probability transformation based uncertainty 
minimization, which are illustrated in following exam- 
ples. 

4.3 Example 3 

The FOD and BPA are as follows [4]: 
e = {9 1 ,9 2 }, m{{9\}) = 0.3, 

TO ({0 2 }) = O.l, ,m({0i,0 2 }) = 0.6 
Based on different approaches, the experimental re- 
sults are derived as listed in Table 3 



Table 3 Probability Transformation Results of 
Example 3 based on Different Approaches 





01 


02 


PIC 


BetP 


0.6000 


0.4000 


0.0291 


PrPl 


0.6375 


0.3625 


0.0553 


PraPl 


0.6375 


0.3625 


0.0553 


PrHyb 


0.6825 


0.3175 


0.0984 


DSmP _0. 001 


0.7492 


0.2508 


0.1875 


PrBcl 


0.7500 


0.2500 


0.1887 


DSmP_0 


0.7500 


0.2500 


0.1887 


Ummin 


0.9000 


0.1000 


0.5310 



DSmP_0 means the parameter e in DSmP is 0 and 
DSmP_0.001 means the parameter e in DSmP is 0.001. 

Is the probability transformation based on PIC max- 
imization (i.e. entropy minimization) rational ? 

It can be observed, in our very simple example 
3, that all the mass of belief 0.6 committed {9 4 ,9 2 } 
is actually redistributed only to the singleton {0i} 
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using the Unmrin transformation in order to get the 
maximum of PIC. 



A deeper analysis shows that with Un_min transfor- 
mation, the mass of belief m{9 1 , 9 2 } > 0 is always fully 
distributed back to {9 1 } as soon as m{{9 1 }) > m({9 2 }) 
in order to obtain the maximum of PIC (i.e. the min- 
imum of entropy). Even in very particular situations 
where the difference between masses of singletons is 
very small like in the following example: 

© = {e 1 ,e 2 }, m({0i}) = o.ioooooi, 

?n({ 6 > 2 }) = 0.1, m({ 6 >i, 6 > 2 }) = 0.7999999. 

This previous modified example shows that the prob- 
ability obtained from the minimum entropy principle 
yields a counter-intuitive result, because m{{9\}) is 
almost the same as m({ 0 2 }) and so there is no solid rea- 
son to obtain a very high probability for 9\ and a small 
probability for 9 2 . Therefore, the decision based on the 
result derived from Unmrin transformation is too risky. 
Sometimes uncertainty can be useful, and sometimes it 
is better to not take a decision than to take the wrong 
decision. So the criterion of uncertainty minimization 
is not sufficient for evaluating the quality /efficiency of a 
probability transformation. There are also other prob- 
lems in the probability transformation based on uncer- 
tainty minimization principle, which are illustrated in 
our next example. 

4.4 Example 4 

The FOD and BPA are as follows: 0 = {9\, 9 2 , 6 * 3 }, 
with , 

m({9 1 ,9 2 }) = m({9 2 ,9 3 }) = m{{9 ll 9 3 }) = 1/3. 



Using the probability transformation based on uncer- 
tainty minimization, we can derive six different proba- 
bility distributions yielding the same minimal entropy, 



which are listed as follows: 

P({0 1 }) = l/3, P{{9 2 }) 
P({9 1 }) = 1/3, P{{9 2 }) 
P({9i }) = 0, P({9 2 }) 

P({9i}) = 0 , P{{9 2 }) 

P({9i}) = 2A P({9 2 }) 
P({9 1 }) = 2/3, P{{9 2 }) 



= 2/3, P({0 3 })=O; 

= 0, P({9 3 }) = 2/3; 

= 1/3, P({9 3 }) = 2/3; 
= 2/3, P({ 6 > 3 }) = 1/3; 
= 1/3, P({9 3 }) = 0; 

= 0 , P({0 3 }) = 1/3. 



It is clear that the problem of finding a probabil- 
ity distribution with minimal entropy does not admit a 
unique solution in general. So if we use the probabil- 
ity transformation based on uncertainty minimization, 
there might exist several probability distributions de- 
rived as illustrated in this Example 4. How to choose 
a unique one? In Example 4, depending on the choice 
of the admissible probability distribution, the decision 
results derived are totally different which is a serious 
problem for decision-making support. 



From our analysis, it can be concluded that the max- 
imization of PIC criteria (or equivalently the minimiza- 
tion of Shannon entropy) is not sufficient for evaluating 
the quality of a probability transformation and other 
criteria have to be found to give more acceptable prob- 
ability distribution from belief functions. The search 
for new criteria for developing new transformations is 
a very open and challenging problem. Until finding 
new better probability transformation, we suggest to 
use DSrnP as one of the most useful probability trans- 
formation. Based on the experimental results shown in 
Examples 1-3, we see that the DSmP can always be 
computed and generate a probability distribution with 
less uncertainty and it is also not too risky, i.e. DSmP 
can achieve a better tradeoff between a high PIC value 
(i.e. low uncertainty) and the risk in decision-making. 



5 Conclusion 

Probability transformation of belief function can be 
considered as a probabilistic approximation of belief 
assignment, which aims to gain more reliable decision 
results. In this paper, we focus on the evaluation crite- 
ria of the probability transformation function. Experi- 
mental results based on numerical examples show that 
the maximization of PIC criteria proposed by Sudano 
is insufficient for evaluating the quality of a probability 
transformation. More rational criteria have to be found 
and to better justify the use of a probability transfor- 
mation with respect to another one. 

All the current probability transformations devel- 
oped so far redistribute the mass of partial ignorances 
to the belief of singletons included in it. The redistri- 
bution is based either only on the cardinality of partial 
ignorances, or eventually also on a proportionalization 
using the masses of singletons involved in partial igno- 
rances. However when the mass of a singleton involved 
in a partial ignorance is zero, some probability trans- 
formations, like Cuzzolin’s transformation by example, 
do not work at all and that’s why the e parameter has 
been introduced in DSmP transformation to make it 
working in all cases. In future, we plan to develop 
a more comprehensive and rational criterion, which 
can take both the risk and the uncertainty degree into 
consideration, to evaluate the quality of a probability 
transformation and to find an optimal probability 
distribution from any basic belief assignment. 
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Algebraic Generalization of Venn Diagram 

Florentin Smarandache 



Abstract. 

It is easy to deal with a Venn Diagram for 1 < n < 3 sets. When n gets larger, the picture 
becomes more complicated, that's why we thought at the following codification. That’s 
why we propose an easy and systematic algebraic way of dealing with the representation 
of intersections and unions of many sets. 

Introduction. 

Let's first consider 1 < n < 9, and the sets Si, S 2 , . . ., S n . 

Then one gets 2 n -l disjoint parts resulted from the intersections of these n sets. Each part 
is encoded with decimal positive integers specifying only the sets it belongs to. Thus: 
part 1 means the part that belongs to Si (set 1) only, part 2 means the part that belongs to 
S 2 only, ..., part n means the part that belongs to set S n only. 

Similarly, part 12 means that part which belongs to Si and S 2 only, i.e. to SiflS 2 only. 

Also, for example part 1237 means the part that belongs to the sets Si, S 2 , S 3 , and S 7 only, 
i.e. to the intersection SinS 2 nS 3 nS 7 only. And so on. This will help to the construction 
of a base formed by all these disjoint parts, and implementation in a computer program of 
each set from the power set P(S 1 S 2 ... S n ) using a binary number. 

The sets Si, S 2 , ..., S n , are intersected in all possible ways in a Venn diagram. Let 1 < k < 
n be an integer. Let’s denote by: iih. . .ik the Venn diagram region/part that belongs to the 
sets Sn and Si 2 and . . . and Sik only^ forall lHind all n. The part which is outside of all sets 
(i.e. the complement of the union of all sets) is noted by 0 (zero). Each Venn diagram will 
have 2 n disjoint parts, and each such disjoint part (except the above part 0) will be formed 
by combinations of k numbers from the numbers: 1, 2, 3, ..., n. 

Example. 

Let see an example for n = 3 , and the sets Si, S2, and S3. 




Fig. 1. 
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Unions and Intersections of Sets. 

This codification is user friendly in algebraically doing unions and intersections in a simple 
way. 

Union of sets Sa, Sb, Sv is formed by all disjoint parts that have in their index either the 
number a, or the number b, . . or the number v. 

While intersection of Sa, Sb, Sv is formed by all disjoint parts that have in their index all 
numbers a, b, v. 

For n = 3 and the above diagram: 

S 1 US 23 = {1, 12, 13, 23, 123}, i.e. all disjoint parts that include in their indexes either the 
digit 1, or the digits 23; 

and SinS 2 = { 12, 123}, i.e. all disjoint parts that have in their index the digits 12. 

Remarks. 

When n > 10, one uses one space in between numbers: for example, if we want to represent 
the disjoint part which is the intersection of S3, S 10 , and S 27 only, we use the notation [3 10 
27), with blanks in between the set indexes. 

Depending on preferences, one can use other character different from the blank in 
between numbers, or one can use the numeration system in base n+1, so each 
number/index will be represented by a unique character. 
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Importance of Sources using the Repeated Fusion 
Method and the Proportional Conflict Redistribution 

Rules #5 and #6 



Florentin Smarandache 
Jean Dezert 



Originally published as a scientific note 2010 in HAL archive, https:// 
hal.archives-ouvertes.fr/hal-00471839. Printed with permission. 



Abstract. 

We present in this paper some examples of how to compute by hand the PCR5 fusion rule for 
three sources, so the reader will better understand its mechanism. 

We also take into consideration the importance of sources , which is different from the classical 
discounting of sources . 

1. Introduction. 

Discounting of Sources . 

Discounting a source mi() with the coefficient 0 < a < 1 and a source 1112(4 with a coefficient 
0 <f3 < 1 (because we are not very confident in them), means to adjust them to mi ’(.) and in 2 ’(.) 
such that: 

mj ’(A) = a -mi (A) for A 4 0 (total ignorance), and in / ’( 0 ) = a mi(Q)+ 1-a, 
and m2 ’(A) = (> ■1112(A) for A 4 0 (total ignorance), and in 2 '( 0 ) = f( ■ni2( 0 )+ 1- [3. 

Importance of Sources using Repeated Fusion . 

But if a source is more important than another one (since a such source comes from a more 
important person with a decision power, let’s say an executive director), for example if source 
m2(.) is twice more important than source in /(.), then we can combine mi(.) with m 2 (.) and with 
ni2(), so we repeated m2() twice. Doing this procedure, the source which is repeated (combined) 
more times than another source attracts the result towards its masses - see an example below. 
Jean Dezert has criticized this method since if a source is repeated say 4 times and other source is 
repeated 6 times, then combining 4 times ni/(.) with 6 times m 2 (.) will give a result different from 
combining 2 times mi(.) with 3 times although 4/6 = 2/3. In order to avoid this, we take 
the simplified fraction n/p, where gcd(n, p) =1, where gcd is the greatest common divisor of the 
natural numbers n and p. 

This method is still controversial since after a large number of combining n times mi() with p 
times in 2/.) for n+p sufficiently large, the result is not much different from a previous one which 
combines «/ times m/(.) with pi times /«?(.) for ni+pi sufficiently large but a little less than n+p, 
so the method is not well responding for large numbers. 
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A more efficacy method of importance of sources consists in taking into consideration the 
discounting on the empty set and then the normalization (see especially paper [1] and also[2]). 

2. Using m PCR5 for 3 Sources. 

Example calculated by hand for combining three sources using PCR5 fusion rule. 

Let’s say that m 2 (.) is 2 times more important than /«,(.); therefore we fusion mi(), 
m 2 (.), m 2 (). 





A 


B 


AUB 


Af|B=0 


m 1 


0.1 


0.7 


0.2 




m 2 


0.4 


0.1 


0.5 




m 2 


0.4 


0.1 


0.5 




m ni 


0.193 


0.274 


0.050 


0.483 



X L4 _ yiB _ Z 2AUB _ 0-005 _ 0-05 
0.1 0.7 0.5 0.7 7 

x 1A =0.000714 
y iB =0.000714 
z 2aub =0.003572 

_ yiB _ Z 24Ul _ 0- 14 _ Q- 07 _ 0- 7 

0.4 0.7 0.5 1.6 0.8 8 

x 2A = 0.035000 
y 2B =0.061250 
z 2aub =0.043750 

x 3a _ y^B _ z 3a\jb _ Q-008 _ Q-08 
0.4 0.1 0.2 0.7 7 

x 3 ^ =0.004571 
y 3B =0.001143 
z 3AilB = 0.002286 

^ _ y 4 b _ Z 4AUB _ (0.4X0. 1)(0.2) 0.008 0.08 

0.4 0.1 0.2 0.7 0.7 7 

x 4A =0.004571 
y 4B = 0.001143 

z 4aub = 0.002286 
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X 5A _ y 5B _ Z 5A{JB _ 0-14 _ 1-4 

0.4 ~ 0.7 ~ 0.5 ~~ 1.6 _ 16 
x 5A = 0.035000 
y 5B = 0.061250 

z 5^uit = °- 043750 



■*6 A 

0.1 



y^B _ Z 6AUB 

0.1 _ 0.5 



x 6 ^ =0.000714 
y 6B — 0-000714 



Z 6AUB = 0.003572 



0005 005 

0.7 ~~ 7 



x 7 , _ y 7B (0.1)(0.1)(0.1) _ 0.001 
0.1 (0.1)(0.1) 0.1 + 0.01 0.11 
x 6A = 0.000909 
y 6B =0.000091 



^ _ y*B _ (0.4)(0.7)(0.1) _ 0.028 2.8 

0.4 (0.7)(0.1) 0.1 + 0.01 0.47 47 

x SA = 0.023830 
y SB =0.004170 



Xg A = x SA = 0.023830 
y 9B =y ss = 0.004170 

*t(M _ y l0B _ (0.1)(0.4)(0.1) _ 0.004 0.4 0.2 

(0.1X0.4) 0.1 0.04 + 0.1 0.14 14 7 

x l0A = 0.001143 
y Sfi = 0.002857 



X UA = X WA =0.001143 
yiiB = yi0B = 0.002857 



49 




Advances and Applications of DSmT for Information Fusion. Collected Works. Volume 4 



*i 2 ^ ... . Jim _ (0.4)(0.4)(0.7) 0.112 11.2 

(0.1)(0.4) 0.1 0.16 + 0.7 0.86 86 

x l2A = 0.020837 
y\iB — 0-091163 

A B AUB 

< 2 f 5 0.345262 0.505522 0.149216 

If we didn’t double m 2 (.) in the fusion rule, we’d get a different result. 
Let’s suppose we only fusion mi() with m 2 (): 





A 


B 


AUB 


Af|B=® 




0.1 


0.7 


0.2 




m 2 


0.4 


0.1 


0.5 




m l2 


0.17 


0.44 


0.10 


0.29 


,„ PCR5 

m x2 


0.322 


0.668 


0.100 


0 



And now we compare the fusion results: 

A B AUB 

ni['yA 0.345 0.506 0. 1 49 - three sources (sec ond - source - doubled ); importance of sources considered; 
0.322 0.668 0. 100 - two sources; importance of sources not considered. 

The more times we repeat m 2 (.) the closer tn[ 2 R 2 5 (A) A m 2 (A)=0.4, m[ 2 R 2 (B) A m2(B)=0.1, and 

m™ 5 (A uBj9 m 2 (A u B )=0.5. Therefore, doubling, tripling, etc. a source, the mass of each 

element in the frame of discernment tends towards the mass value of that element in the repeated 
source (since that source is considered to have more importance than the others). 

For the readers who want to do the previous calculation with a computer, here it is the m PCR5 

Formula for 3 Sources: 

m PCR5 (-^) — m i23 

X,YgG & 
a*x*y*a 
Ar\xnY=<s> 

m l (T)/u 2 [Ay m 2 (A) m l (A)m 2 (T)m 3 (A)~ 

(r) + m 2 M + m 3 (A W + m 2 (?) + m 3 w 




m I (A)An 2 (X)m 3 (F) | 
m 1 (A) + m 2 (x) + m 3 (Y) 
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+ Z 

.YeG 0 

Anx=t> 

+ I 

XeG 0 

Anx=® 



m x (A^ /n 2 (X)m 3 (X) m 1 (X)m 2 (^4) m 3 (X) (X)m 2 (X)m 3 (yf) 

M + /n 2 + m 3 M + m 2 M + m 3 M + /;z 2 + m 3 A) 

(^) 2 m 2 (^) 2 m 3 (X) m, (X)m 2 (yf)~ m 3 (yf)~ m, (yf)“ m 2 (X)m 3 (^) 2 
/«, ( A) + m 2 (yf) + m 3 (X) /«, (X) + m 2 (yf) + m 3 (yf) /«, ( /t ) + m 2 (X) + m 3 (^4) 



+ 



n c - -i , . Formula for 3 Sources: 

3. Similarly, let’s see the PCR6 



m pcR6^^ m m 

^,7eG 0 

A*X*Y*A 

AftXC\Y=® 



m^A) 2 m 2 (X)m 3 (Y) 
m t (A) + m 2 (X) + m 3 (Y) 



m l (Y)m 2 (yt)“ m 3 (X) /«, (X)/« 2 (7)m 3 (yf) 



2 A 



m x (7) + m 2 (yt) + m 3 (X) /«! (X) + /n 2 ( 7 ) + m 3 (A'j 



- Z 

XgG 0 

ac iaxd 



^ /^(yf) m 2 (Xjm 3 (Xj m 1 (X)m 2 (A) m 3 (X) (X)m 2 (X)m 3 (yf) ^ 



m 



3 (y4) + m 2 (X) + m 3 (X) m x (X) + /« 2 (yf) + m 3 (X) /;z, (X) + m 2 (X) + m, (yf) 



+ 



I 



m, (yt)“ m 2 (yt)m 3 (X) + m, (yf)m 2 (yf) m 3 (X) 
n\ A) + m 2 A) + m 3 (X) 



+ 



XgG . 

^rw=<t> v 

m, (X)m 2 (yt ) 2 m 3 (yt) + /«, (X)m 2 (yt)m 3 (yf)~ 



m l (x) + m 2 A) + m 3 M 
W, (yf)~ m 2 (x)m 3 (yf) + /«J (yt)m 2 (x)m 3 (yf) 
in , M + m 2 (2-) + m 3 M 



2 A 



4. A General Formula for PCR6 f or .s - > 2 Sources. 



5-1 



m PCR6 ( A ) = m \2...s + z ,z z I m. (yt) +m i2 (yt) + ... + m h (A) 



X i ,X 2 ,...,X s _ 1 <eG 0 k = 1 (; 1 ,! 2 ,...,( s )eP(l,2,...,i) 
2f,^.iE{l,2,....s-l} 



Qx,.jn.4=® 

m. ( A)m h (A)...m i ( A)m iM (X, ).../«, (X,_ t ) 
m. (A) + m i2 (A) + .. . + m t (A) + (X t ) + . . . + m, (X s _ k ) 



where P(l, 2, s) is the set of all permutations of the elements {1,2, ..., s}. 
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It should be observed that Xj, X 2 , X s .j may be different from each other, or some of them 
equal and others different, etc. 

We wrote this PCR6 general formula in the style of PCR5, different from Amaud Martin & 
Christophe Oswald’s notations, but actually doing the same thing. In order not to complicate the 
formula of PCR6, we did not use more summations or products after the third Sigma. 

As a particular case: 



where P( 1,2,3) is the set of permutations of the elements {1,2,3} . 

It should also be observed that X 1 may be different from or equal to X 2 . 

Conclusion. 

The aim of this paper was to show how to manually compute PCR5 for 3 sources on some 
examples, thus better understanding its essence. And also how to take into consideration the 
importance of sources doing the Repeated Fusion Method . We did not present the Method of 
Discounting to the Empty Set in order to emphasize the importance of sources, which is better 
than the first one, since the second method was the main topic of paper [2] . 

We also presented the PCR5 formula for 3 sources (a particular case when n=3 ), and the general 
formula for PCR6 in a different way but yet equivalent to Martin-Os wald’s PCR6 formula. 
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m PCR6 (A) = 

m i23 + z z z 

v v ^rz® k-\ (i. ;_\e=P 1 



2 




X„X,e G® k = 1 ((.A, ; 3 )eP(1,2, 3) 

x\*a,x 2 *a 

jr I rw 1 n u=® 



m h (A) + . . . + m it (A) + m, M (X x ) + . .. + ra. (X 2 ) 
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Abstract 

This paper proposes a new solution for reducing the number of sources 
of evidence to be combined in order to diminish the complexity of the fusion 
process required in some applications where the real-time constraint and 
strong computing resource limitation are of prime importance. The basic 
idea consists in selecting, among the whole set of sources of evidence, only the 
biggest subset of sources which are not too contradicting based on a criterion 
of Evidence Supporting Measure of Similarity (ESMS) in order to process 
solely the coherent information received. The ESMS criterion serves actually 
as a generic tool for outlier source identification and rejection. Since the 
ESMS between several belief functions can be defined using several distance 
measures, we browse the most common ones in this paper and we describe 
in detail the principle of our Generalized Fusion Machine (GFM). The last 
part of the paper shows the improvement of the performances of this new 
approach with respect to the classical one in a real-data based and real-time 
experiment for robot perception using sonar sensors. 

Key words: 

Information fusion; Belief function; Complexity reduction; Robot 
perception; DSmT; Measure of similarity; Distance; Lattice. 
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1. Introduction 

Information fusion (IF) has gained more and more interest in the scien- 
tific community since the end of nineties because of the development of so- 
phisticated multisensor and hybrid (involving human feedbacks in the loop) 
systems in many fields of applications (robotics, defense, security, medicine, 
etc.). IF appears through many scientific international conferences and work- 
shops [11]. The main theories useful for information fusion are the Proba- 
bility theory [16, 25] (and more recently the Imprecise Probability Theory 
[38]), the Possibility Theory [8] (based on Fuzzy Sets theory [42]), Neutro- 
sophic Set Theory [15] and belief function theories, mainly Dempster-Shafer 
theory (DST) [29] and more recently Dezert-Smarandaehe theory (DSmT) 
[31, 32, 33], 

In this work, we concentrate our attention on belief functions theories and 
specially on DSmT because of its ability to deal efficiently with uncertain, 
imprecise and conflicting quantitative and qualitative information. Basically, 
in DST, a basic belief assignment (bba) m{.) is a mapping from the power 
set 2 e (see section 2.2 for details) of the frame of discernment 0 into [0, 1] 
such that 

m(0) = 0 and ^ m( X) = 1. 

.Ye 2© 

In DST, 0 represents the set of exclusive and exhaustive possibilities for the 
solution of the problem under consideration. In DSmT, 0 can be a set of 
possible non exclusive elements and the definition of bba is extended to the 
lattice structures of hyper-power set D e , and to super-power set S e in UFT 
(Unification of Fusion Theories) [30, 32], Chap. 8 - see also section 2.3 for a 
brief presentation and [7, 10] and [33] for definitions, details and examples. 
In general m(.) is not a measure of probability, except in the case when its 
focal elements (i.e. the elements which have a strictly positive mass of be- 
lief) are singletons; in such case, m(.) is called a Bayesian bba [29] which 
can be considered as a subjective probability measure. In belief function 
theories, the main information fusion problem consists in finding an efficient 
way for combining several sources of evidence sq, S 2 , . .., s n characterized 
by their bba’s mi(.), m 2 (.), •••> m n(-) assumed for simplicity here defined 
on the same fusion space, either 2®, D & , or S e depending on the underlying 
model associated with the nature of the frame 0. The difficulty in infor- 
mation fusion arises from the fact that the sources can be conflicting (i.e. 
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one source commits some belief in a proposition A whereas another source 
commits some belief in a proposition B but A and B are known to be truly 
exclusive {Ad B = 0)) and one needs a solution for dealing with conflict- 
ing information in the fusion process. In DST, Shafer proposes Dempster’s 
rule of combination as the fusion operator for combining sources of evidence 
whereas in DSmT the recommended fusion operator is the PCR5 (Propor- 
tional Conflict redistribution rule # 5) rule of combination, see [29] and [33] 
for discussions and comparisons of these rules. PCR5 is more complex than 
Demspter’s rule but it offers a better ability to deal with conflicting infor- 
mation. 

Both rules however become intractable in some applications having only 
low computational capacities (as in some autonomous onboard systems by 
example) because their complexity increases drastically with the number n 
of sources to combine and/or with the size of the frame 0, specially in the 
worst case (i.e. when a strict positive mass of belief is committed to all ele- 
ments of the fusion space). To circumvent this problem, one has to play on 
both sides: 1) reducing the number of sources to combine and 2) reducing 
the size of the frame 0. In this paper, we propose a solution only for re- 
ducing the number of sources to combine because we are not concerned in 
our application of robot perception by the second aspect since in this ap- 
plication our frame 0 has only two elements representing the emptiness or 
occupancy states of the grid cells of the perceived map of the environment. 
To expect good performances of such limited-resource fusion scheme, it seems 
natural to search and combine altogether only the sources which are coher- 
ent (which are not too conflicting) according to a given measure of similarity. 

Such idea has been already investigated by several authors who have 
proposed some distance measures between two evidential sources in different 
fields of applications. For example, Tessem [35] in 1993 proposed the distance 
dij = maxe l£ Q\BetPi(6i) — BetPj[6i ) |) according to the pignistic probability 
transform BetP(.). In 1997, Bauer [1] introduced two other measures of error 
to take a decision based on pignistic probability distribution after approxi- 
mation. In 1998, Zouhal and Denoeux [43] also introduced a distance based 
on mean square error between pignistic probability. In 1999, Petit-Renaud 
[26] has defined a measure directly on the power set of 0 and proposed an 
error criterion between two belief structures based on the generalized Haus- 
dorff distance. In 2001, Jousselme et al. [14] proposed in DST framework 
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a new distance measure dij = 1 — + mf — 2 (mi, m 2 ) between two 

basic belief assignments (bba’s) for measuring their similarity (closeness). 
In 2006, Ristic and Smets [27, 28] have defined in the TBM (Transferable 
Belief Model) framework a TBM-distance between bba’s to solve the associ- 
ation of uncertain combat ID d eclarations. These authors recall also the 
Bhattacharya distance d tJ = 1 — between 

two bba’s. In 2006 also, Diaz et al. [6] proposed a new measure of simi- 
larity between bba’s based on Tversky’s similarity measure [37]. Note that 
in belief function theories, the direct use of classical measures used in Proba- 
bility theory (say like Kullback Leibler (KL) distance [3]) cannot be applied 
directly because bba’s are not probability measures in general. 

In this paper, we develop an Evidence Support Measure of Similarity 
(ESMS) in a generalized fusion space according to different lattices [7, 10] 
for reducing the number of sources of evidence to combine and thus reducing 
the complexity of the computational burden. As shown in the next sections, 
we propose several possible measures of distance for ESMS and we compare 
their performances in our specific application of mobile robot perception. 
The purpose of this paper is not to select, nor to justify, the best measure 
of distance for ESMS but only to show the practical advantage of using the 
ESMS criteria as a generic tool for reducing the complexity of the fusion with 
keeping good performances for our application. 

This paper is organized as follows. In section 2, we briefly recall the 
main paradigms for dealing with uncertain information. In section 3, we 
give a general mathematical definition of ESMS between two basic belief 
assignments and we establish some basic properties of ESMS. In section 4, 
we extend and present different possible ESMS functions (distance measures) 
fitting with the different mathematical paradigms listed in section 2. A 
comparison of the performances of five possible distances is made through 
a simple example in section 5. The simulation presented in section 6 shows 
in details how ESMS filter is used within GFM scheme. An application 
of ESMS filter in GFM for mobile robot perception with real-data (sonar 
sensors measurements) and in real-time is presented in section 7 to show the 
advantages of the approach proposed here. The conclusion is given in section 
8 . 
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2. The main paradigms for dealing with uncertainties 



2.1. Probability Theory and Bayes’ rule. 

The (axiomatic) Probability Theory [16] is the most achieved theory for 
dealing with randomness. We will not present this theory in details since 
there exist dozens of very good classical books devoted to it, see for exam- 
ple [25]. We just recall that a random experiment is an experiment (action) 
whose result is uncertain before it is performed and a trial is a single per- 
formance of the random experiment. An outcome is the result of a trial and 
the sample space 0 is the set of all possible outcome of the random experi- 
ment. An event is the subset of the sample space 0 to which a probability 
measure can be assigned. Two events A* and Aj are said exclusive (disjoint) 
if Ai fi Aj = 0, Vi 7 ^ j, where the empty set 0 represents the impossible 
event. The sure event is the sample space 0. The probability theory is based 
on Set Theory and the measure theory on sets. The following axioms have 
been identified as necessary and sufficient for probability P(.) as a measure: 
Axiom 1) (nonnegativity) 0 < P(A) < 1, Axiom 2) (unity) P(0) = 1, and 
Axiom 3) (finite additivity 1 ), if A l5 A 2 , . . ., A n are disjoint events, then 
P(Ai U A 2 U . . . U A n ) = Y2i= i P(Ai). Events which are subsets of the sam- 
ple space are put in one-to-one correspondence with propositions in belief 
fuction theory [29], pages 35-37 and that’s why we use indifferently the ter- 
minology set, event or proposition in this paper. The probabilistic inference 
is (usually) carried out by Bayes’ rule according to: 



VP,P(P)>0, P(Ai\B) 



P(A i n B) 

P(B) 



P(B\Ai)P(Ai) 



(i) 



where the sample space 0 has been partitioned into exhaustive and ex- 
clusive events Ai, A 2 , . . . , A n , i.e. such that A t A Aj = 0, (i ^ j) and 
Ai U A 2 U . . . U A n = 0; P(.) is an a priori probability measure defined 
on 0 satisfying Kolmogorov’s axioms. In Bayes formula, it is assumed that 
the denominator is strictly positive. A generalization of this rule has been 
proposed by Jeffrey [12, 13] for working in circumstances where the parochial- 
ist assumption is not a reasonable assumption, i.e. when P(B\B) = 1 is a 
fallacy, see [13, 22] for details and examples. 



1 Another axiom related to the countable additivity can be also considered as the fourth 
axiom of the probability theory. 
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Using the classical terminology adopted in belief function theories (DST 
and/or DSmT) and considering for example 0 = {A, B}, a discrete proba- 
bility measure P(-) can be interpreted as a specific Bayesian belief mass m(.) 
such that 

m(A) + m(B) = 1 (2) 

2.2. Dempster- Shafer Theory (DST) 

In DST [29], the frame of discernment 0 of the fusion problem under 
consideration consists in a discrete finite set of n exhaustive and exclusive 
elementary hypotheses 9,, i.e. 0 = {9i, 0 2 , . . ■ , 9 n }. This is called Shafer’s 
m.odel of the problem. Such model assumes that an ultimate refinement of 
the problem is possible, exists and is achievable, so that elements 9i, i = 
1,2 , ... ,n are well precisely defined and identified in such a way that we are 
sure that they are truly exclusive and exhaustive (closed- world assumption). 
The set of all subsets of 0 is called the power set of 0 and is denoted 2 e . 
Its cardinality is 2l 0 L Since 2 e is closed under U and all 0$, i = 1,2, ... ,n 
are exclusive, it defines a Boolean algebra. All composite propositions built 
from elements of 0 with U operator such that: 

1) 0A,....Ae 2 e ; 

2) If A, Be 2 e , then A U B e 2 e ; 

3) No other elements belong to 2 e , except those obtained by using rules 
1) or 2). 

Shafer defines a basic belief assignment (bba), also called mass function, as 
a mapping m(.) : 2 e — > [0, 1] satisfying m(0) = 0 and the normalization 
condition. Typically, when 0 = {A, B} and Shafer’s model holds, in DST 
one works with m(.) such that 

m(A) + m(B) + m(A U B) — 1 (3) 

m(A U B) allows us to commit some belief on the disjunction AU B which 
represents the ignorance in choosing between A and B. From this very sim- 
ple example, one sees clearly the ability of DST to offer a better modeling 
for a total ignorant /vacuous source of information by setting m(A U B) = 1, 
whereas in Probability Theory one would be forced to adopt the principle of 
insufficient reason (as known also as the principle of indifference) to justify 
taking m(A ) = m(B) = 1/2 as default belief mass for representing a total 
ignorant body of evidence. 
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In DST framework, the combination of two belief assignments mi(.) and 
m 2 (■) is done using Dempster’s rule of combination which can be seen as 
the normalized version of the conjunctive rule in order to remove the total 
conflicting mass and to get a proper normalized belief mass after the combi- 
nation [29]. Dempster’s rule is mathematically defined by m(0) = 0 and for 
X ^ 0 by 



m(X) 



E m 1 (X 1 )m 2 (X 2 ) 

Xi,v 2 e2 e 

x 1 nx 2 =x 

1 - E rn l (X 1 )m 2 (X 2 ) 

Vi,x 2 e2 e 

x 1 nx 2 =<H 



(4) 



Dempster’s formula is dehned if and only if the two sources of evidence 

are not fully conflicting; that is when x 2 e 2 ° m i(^i) rn 2 (X 2 ) ^ 1. 

Vi’nx 2 =0 



2.3. Dezert-Smarandache Theory (DSmT) 

In DSmT framework [31, 32, 33], the frame 0 = {$i, d 2 , . . . , 0 n } is a hnite 
set of n exhaustive elements which are not necessary exclusive. The prin- 
ciple of the third excluded middle and Shafer’s model are refuted in DSmT 
(but can be introduced if needed depending on the model of the frame one 
wants to deal with), since for a wide class of fusion problems, the nature of 
hypotheses can be only vague and imprecise or crude approximation of the 
reality and none ultimate refinement is achievable. As a simple example, if 
we consider two suspects Peter (P) and Mary (M) in some criminal investi- 
gations, it may be possible that Peter has committed the crime alone, as well 
as Mary, or maybe Peter and Mary have committed the crime together. In 
that case, one has to consider the possibility for P fl M ^ 0 but there is no 
way to refine the original frame 0 = {P, Ad} into a finer one with exclusive 
hirer elements say as 0' = {P \ (P D M), P fl M, M \ (P fl M)} because there 
is no physical meaning and no possible occurrence of the atomic granules 
P\(Pfl M) and M\(P fl M). In other words, the finer exclusive elements 
of the refined frame satisfying Shafer’s model cannot always be well identi- 
fied and precisely separated and they may have no sense at all. This is the 
main reason why DSmT allows as foundation the possibility to deal with non 
exclusive, partially overlapped or vague elements and refute Shafer’s model 
and third excluded middle assumptions. DSmT proposes to work on a fusion 
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space defined by Dedekind’s lattice also called hyper-power set D @ in DSmT. 

The hyper-power set is defined as the set of all composite propositions 
built from elements of 0 with D and U operators such that [5]: 

1) 0,0!,. ..A efl e ; 

2) If A, B G D e , then A U B G D 0 and A n B e D e ; 

3) No other elements belong to D & , except those obtained by using rules 
1) or 2). 

Following Shafer’s idea, Dezert and Smarandache define a (generalized) basic 
belief assignment (or mass) as a mapping m(.) : D & — > [0, 1] such that: 

m(0) = 0 and ^ m(X) = 1. 

xeD e 



Typically, when 0 = {A, B} and Shafer’s model doesn’t hold, in DSmT 
one works with m(.) such that 

m(A) + m(B) + m(A U B) + m(A nB) = l (5) 

which appears actually as a direct and natural mathematical extension of (2) 
and (3). 

Actually DSmT offers also the advantage to work with Shafer’s model 
or with any hybrid model if some integrity constraints between elements of 
the frame are known to be true and must be taken into account in the fu- 
sion process. DSmT allows to solve static and/or dynamic 2 fusion problems 
in the same general mathematical framework. For notation convenience, one 
denotes by G e the generalized fusion space or generalized power set including 
integrity constraints (i.e. exclusivity as well as possible non-existence restric- 
tions between some elements of 0), so that G e = D e when no constraint 
enters in the model, or G e = 2 e when one wants to work with Shafer’s 
model (see [31] for details and examples), or G e = 0 when working with 
probability model. If one wants to work with the space closed under union 
U, intersection D, and complementarity C operators, then G & = S e , i.e. the 
super-power set (see next section). A more general introduction of DSmT 



2 i.e. when the frame and/or its model change with time. 
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can be found in Chapter 1 of [33]. 



In DSmT, the fusion of two sources of evidences characterized by mj(.) 
and m 2 (.) is defined by riipcnsi®) = 0 and \/X e G e \ {0} 



’^PCRsi.^) ~ m 12(X) + 



£ i 

veG® 



mi(X) 2 m 2 (Y) 
mfiX) + m 2 (Y) 



xn y=0 



+ 



m 2 (X) 2 mi(Y) 
m 2 {X) + mi (Y) 1 J 



where all sets involved in formulas are in canonical form; m\ 2 (X) = m n (X) = 

ExiV 2eG e ^i(^i)^(X 2 ) corresponds to the conjunctive consensus on X 
x(nX 2 =X 

between the n — 2 sources and where all denominators are different from 
zero. If a denominator is zero, that fraction is discarded. A general formula 
of PCR5 for the fusion of n > 2 sources has been proposed in [32], 

2-4- Unification of Fusion Theory (UFT) 

Recently Smarandache has proposed in [30, 32] an extension of DSmT by 
considering a super-power set S e as the Boolean algebra on 0, i.e. S e = 
(0, fl, U, c(.)). In other words, S e is assumed to be closed under union U, 
intersection D, and complement c(.) of sets respectively. With respect to the 
partial ordering relation, the inclusion C, the minimum element is the empty 
set 0, and the maximal element is the total ignorance / = IJ]=i Since it 
extends the power set space through the closed operation of fl, U and c(.) 
operators, that is, UFT not only considers the non-exclusive situation among 
the elements, but also consider the exclusive, exhaustive, non-exhaustive 
situations, and even open and closed world. Typically, when 0 = {A, B} , 
in UFT one works with m(.) such that 

m(A) + m(B) + m(A fl B) + m(A U B) 

+ m(c(A)) + m(c(B )) + m(c(A ) U c{B)) = 1 (7) 



3. Evidence Support Measure of Similarity (ESMS) 

Definition 3.1. Let’s consider a discrete and finite frame 0 and the fusion 
space G e including integrity constraints of the model associated with 0. The 
infinite set of basic belief assignments defined on G e is denoted by m G e . An 
Evidence Support Measure of Similarity (ESMS) of two ( generalized ) basic 
belief assignments mfi.) and m 2 (.) in m G e is the functionSimf , .) : m G e x 
m G e — > [0, 1] satisfying the following conditions: 
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1) Symmetry: Vmi(.),m 2 (.) G m G e, Sim(mi, m 2 ) = Sim(rri 2 ,mi); 

2) Consistency: Vm(.) G m G e, Sim(m,m) = 1; 

3) Non-negativity: Vmi(.),m 2 (.) G m G e, Sim(mi, m 2 ) > 0 

We will say that rri 2 (.) is more similar to mi(.) than ms(.) if and only if 
Sim(mi,m 2 ) > Sim{mi,mf). The maximum degree of similarity is naturally 
obtained when both bba’s m\{.) and rri 2 (.) coincide, which is expressed by 
consistency condition 2). The equality = 0 must be obtained 

when bba’s have no focal elements in common, in particular whenever m i(.) is 
focused on X G G e , which is denoted rnf (.) and corresponds to mi(X) = 1, 
and m 2 (.) is focused on Y G G e , i.e. m 2 (.) = m\() such that rri 2 (Y) = 1, 
with X fl h = 0. 

Theorem 3.1. For any bba mi(.) G m G e (which is a |G H |- dimensional 
vector) and any small positive real number e, there exists at least one bba 
m 2 (.) G m G e for a given distance measure 3 d (., .) such that d(mi,m 2 ) < e. 

Proof: Let’s take m 2 (.) = mi(.), then d(mi, m 2 ) = d(mi,mi) = d(m 2 , m 2 ) = 
0 < e which completes the proof. 

Definition 3.2. (Agreement of evidence) : If there exist two basic belief 
assignments m\{.) and m 2 (.) in m G e snc/i that for some distance measure 
d(.,.), one has d(mi,m 2 ) < e with e > 0, f/ien e is ca//ed t/ie agreement 
of evidence supporting measure between mi(.) and m 2 (.) with respect to the 
chosen distance d{., .). mi(.) and m 2 (.) are said e-consistent with respect to 
the distance d (., .). 

Theorem 3.2. The smaller e > 0 is, ide closer the distance d(mi,m 2 ) be- 
tween mi(.) and m 2 (.) is, idai is, ide more similar or consistent m\{.) and 
m 2 (.) are. 

Proof: According to the Definition 3.2, if the evidence measure between 
mi(.) and m 2 (.) is e-consistent, then d(mi,m 2 ) < e. Let’s take e = 1 — 
Sim(mi, m 2 )] when e becomes smaller and smaller, Aim(mi,m 2 ) becomes 
greater and greater, according to the definition of ESMS and thus more simi- 
lar or consistent mi(.) and m 2 (.) become. Finally, if e = 1—Sim(mi,ni2) = 0, 



3 Here we don’t specify the distance measure and keep it only as a generic distance. Ac- 
tually d(., .) can be any distance measure. In practice, the Euclidean distance is frequently 
used. 
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then mi(.) and rri 2 (.) are totally consistent. 

From the previous dehnitions and theorems, the ESMS appears as an in- 
teresting measure for evaluating the degree of similarity between two sources. 
We propose to use ESMS in a pre-processing/thresholding technique in order 
to reduce the complexity of the combination of sources of evidence by keeping 
in the fusion process only the sources which are e-consistent. e is actually 
a threshold parameter which has to be tuned by the system designer and 
which depends on the application and computational resources. 



4. Several possible ESMS 

In this section we propose several possibilities for choosing an ESMS 
function Sim (., .) satisfying theorem 3.1. 



4-1- Euclidean ESMS function SimE(mi,m 2 ) 

Definition 4.1. Let 0 = {9i, . . . ,9 n } (n > 1), mi(.) and m 2 {.) in m G e , 
Xi the i-th ( generic ) element of G e and |G' e j the cardinality of G e . The 
following simple Euclidean ESMS function can be extended from [14]: 



SiniE(mi,m 2 ) = 1 




|G®| 



E ( m l ( X i) ~ m 2 ( X i)f 



( 8 ) 



The following theorem establishes that S , mB(mi,m 2 ) is an ESMS func- 
tion. 



Theorem 4.1. SimE(mi, m 2 ) defined in (8) is an ESMS function. 

Proof: 

1) Let’s prove that SimE(mi,m 2 ) G [0,1]. If SimE(mi,m 2 ) > 1, from 

(8) one would get (wi(-^») — m 2 (W)) 2 < 0 which is impos- 

sible, so that SimE(mi, m 2 ) < 1. Let’s prove SimE{mi,m 2 ) > 0 or 
equivalently from (8), ( m i (Xi) — m 2 (X i )) 2 < 2. This inequality 

is equivalent to E=i 1 ^i(^i) 2 + Z)[=i 1 rn 2 (Xi) 2 <2 + 2 1 
m 2 (Xj). We denote it (i) for short, (i) always holds because one has 

(ElS 1 ™.(V.) 2 +EE' m 2 (A'.) 2 ) < (lEE 1 m.(A.)] 2 +|EE' n> 2 (V)] 2 ) 

and thus 
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(EEi' m i (A) 2 + EEi' m 2 (Xi) 2 ) < 2 because EEi' m a (Xi )] 2 = 1 for 
s — 1,2 (m s (.) being normalized bba). Therefore inequality (i) holds 
and thus m 2 ) > 0. 

2) It is easy to check that SimE(mi,m 2 ) satisfies the first condition of 
Definition 3.1. 

3) If m\{.) = m 2 (.), then SimE(mi,m 2 ) = 1 because 

|G©| 

yE ( m l( X i) - m 2 {Xi)f = 0. 

1=1 

Thus the second condition of Definition 3.1 is also satisfied. 

4) Non-negativity has been proven above in the first part. Herein we use 
a particular case to show that Sim(ml, m2)=0, i.e. there exist rnf and 
mf for some A", Y e G e \ {0} such that X ^ Y, then according to (8), 
one gets EE? ( m i ( X i) - m 2 (Xi)) 2 = [mf(A)] 2 + [m\{Y)} 2 = 2 and 
thus one has SimE(mf ,rrt^) = 1 — (-s/2/\/2) = 0, so that SirriE (.,.) 
verifies the third condition of Definition 3.1. 

4-2. Jousselme ESMS function Simj(mi,m 2 ) 

Definition 4.2. Let mi(.) and m 2 {.) be two basic belief assignments in m G e 
provided by the sources of evidence Si and S 2 . Given a |G e | x G' e | assumed 4 
positive definite matrix D = [Aj], where Dij = (A* n Xj\/\Xi U Xj\, with 
Xi,Xj G G e . Then, Jousselme ESMS function can be redefined from the 
Jousselme et al. measure [14]: 

Simj(mi,m 2 ) = 1 -=\J (mi — m 2 ) T D(mi — m 2 ) (9) 

V2 

or equivalently 



Simj(mi,m 2 ) = 1 -j= 

v 2 



\j m\ + — 2{mi,m 2 ) 



4 Actually, Jousselme et al. in [14] did not prove that D = [Dij = \Xi fl Xj\/\Xi U Xj\] 
is truly a positive definite matrix. D is until now assumed to be positive definite. This is 
only a conjecture and proving it is not a trivial problem. 
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where (mi, m 2 ) is the scalar product defined as 

]G e | |G e | 

(mi,m 2 ) = EE D ij m 1 (X i )m 2 (X j ) 
i= 1 j= 1 

X i ,X j eG e ,i,j = l,...,s,\G% || m || 2 represents the squared norm of the 
vector (bba) m, i.e. ||m|| 2 = ( m,m ). 

Theorem 4.2. Simj(m\,m 2 ) defined in formula (9) is an E SMS function. 

Proof: 

1) Since the matrix D is conjectured to be a positive definite matrix, 
Simj(mi,m 2 ) satisfies the condition of symmetry. 

2) If mi is equal to m 2 , according to (9), one gets Simj(mi,m 2 ) = 1. In 
other hand, if Simj(mi,m 2 ) = 1, then the condition m \ = m 2 holds. 
That is, the condition of consistency is satisfied. 

3) According to (9), it can be drawn that Simj(mi,m 2 ) < SimE(m\, m 2 ), 
and since the minimum value of Simj(mi,m 2 ) is zero, then Simj(mi,m 2 ) 
is non-negative. 

4) According to the definition of Simj(mi, m 2 ), we can easily verify that 
Simj(mi, m 2 ) is a true distance measure between m\ and m 2 . 

Actually SimE(mi,m 2 ) is nothing but a special case of Simj(mi,m 2 ) 
when taking D as the |G H | x |G' H | identity matrix. 

4-3. Ordered ESMS function Simo(mi,m 2 ) 

The definition of this (partial) ordered-based ESMS function is similar 
to Simj(mi,m 2 ) but instead of using Jousselme’s matrix D = [D. y ], where 
Dij = \Xi n Xj\/\ Xi U Xj I, with Xi, Xj G G e , we choose the DSm matrix 
S = [Sij] where Sij = s(Ab n Xf)/s(X i U Xf). Therefore, one has 

Simo(mi,m 2 ) = 1 -=\J (m 1 — m 2 ) r S(mi — m 2 ) (10) 

V2 

The function s(X) corresponds to the intrinsic informational content of 
the proposition X defined in details in [31] (Chap. 3) which is used for 
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